39
1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12. 5. 9:30 Haagu pak cesta do Prahy NK 0 136 viz. p 10. 5. - 11:50 Odlet z 11. 5. - General 12. 5. - General 19:00 Odlet z Amsterdam Schiphol 25. 5. 2011

Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

1

Marek Melichar ODODD Preservation Working Group

HAAG

Datum (od-do) - 10 5 Cesta do Haagu

11 5 Haagu 12 5 930 Haagu pak cesta do Prahy

NK 0 136

viz

p 10 5 - 1150 Odlet z

11 5 - General

12 5 - General

1900 Odlet z Amsterdam Schiphol

25 5 2011

2

PINY PRESERVATION GROUP IIPC 11 A 12 5 2011 HAAGUE

Marek Melichar 13 5 2011

OBSAH

2

3

JHOVE 2 3

Migrace ARC gt WARC 3

4

Preservation metadata pro webarchiv 4

4

4

-depotu 5

6

3

KUPINY IIPC WG PRO PRESERVATION V

JHOVE 2 BN firmou ATOS Origin

WARC nebo html Bude se jednat o tom aby z

Jhove 1

ziskat s webarchivu Tzn Nainstalovat jhove2 v

infrastruktPokud bychom byli schopni proces validace monitorovat z

MIGRACE ARC gt WARC Debata o migraci ARC do WARCu je to preservation action-

Porovnat ARC a WARC a podle toho se rozhodnout c

4

A PRESERVATION GROUP

PRESERVATION METADATA PRO WEBARCHIV

BNF pracuje na implementaci PREMISu pro WA bu

ARCu nebo WARCU

s

B ARCHIV

vice s

NK

pro IOP ingest dat z WA

ZIK SOFTWARU A FORM

dokumentace k

5

httpwwwignaciogccomnetpreserverisksphp

M E-DEPOTU

E-

- jsou v (journals digitized masters and papers i WA)

- digitized master za 20s z pasky

-

-

-

-

- -

k

-depot a budou k dispozici v AJ vsem

e-depot

- E-depotu

-

dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd

6

V - -depotu

digitalizace v

-

-

-

xy atd) tak arconu process data store i pro management a CRM

Data model - Sledovat OPF toho co vyvinou

NTACE

filtrov f -

Projekty SCAPE a KEEP - ti v

EU ever

ISO standard pro WA kolik obsahu WA v CR

COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA

Marketing pro WA -

7

archiv le ights speci

-

Popularizace WA mezi techniky a IT komunitami

collection policy digital strategy

chaosu daleko

a pak j v v KB

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 2: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

2

PINY PRESERVATION GROUP IIPC 11 A 12 5 2011 HAAGUE

Marek Melichar 13 5 2011

OBSAH

2

3

JHOVE 2 3

Migrace ARC gt WARC 3

4

Preservation metadata pro webarchiv 4

4

4

-depotu 5

6

3

KUPINY IIPC WG PRO PRESERVATION V

JHOVE 2 BN firmou ATOS Origin

WARC nebo html Bude se jednat o tom aby z

Jhove 1

ziskat s webarchivu Tzn Nainstalovat jhove2 v

infrastruktPokud bychom byli schopni proces validace monitorovat z

MIGRACE ARC gt WARC Debata o migraci ARC do WARCu je to preservation action-

Porovnat ARC a WARC a podle toho se rozhodnout c

4

A PRESERVATION GROUP

PRESERVATION METADATA PRO WEBARCHIV

BNF pracuje na implementaci PREMISu pro WA bu

ARCu nebo WARCU

s

B ARCHIV

vice s

NK

pro IOP ingest dat z WA

ZIK SOFTWARU A FORM

dokumentace k

5

httpwwwignaciogccomnetpreserverisksphp

M E-DEPOTU

E-

- jsou v (journals digitized masters and papers i WA)

- digitized master za 20s z pasky

-

-

-

-

- -

k

-depot a budou k dispozici v AJ vsem

e-depot

- E-depotu

-

dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd

6

V - -depotu

digitalizace v

-

-

-

xy atd) tak arconu process data store i pro management a CRM

Data model - Sledovat OPF toho co vyvinou

NTACE

filtrov f -

Projekty SCAPE a KEEP - ti v

EU ever

ISO standard pro WA kolik obsahu WA v CR

COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA

Marketing pro WA -

7

archiv le ights speci

-

Popularizace WA mezi techniky a IT komunitami

collection policy digital strategy

chaosu daleko

a pak j v v KB

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 3: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

3

KUPINY IIPC WG PRO PRESERVATION V

JHOVE 2 BN firmou ATOS Origin

WARC nebo html Bude se jednat o tom aby z

Jhove 1

ziskat s webarchivu Tzn Nainstalovat jhove2 v

infrastruktPokud bychom byli schopni proces validace monitorovat z

MIGRACE ARC gt WARC Debata o migraci ARC do WARCu je to preservation action-

Porovnat ARC a WARC a podle toho se rozhodnout c

4

A PRESERVATION GROUP

PRESERVATION METADATA PRO WEBARCHIV

BNF pracuje na implementaci PREMISu pro WA bu

ARCu nebo WARCU

s

B ARCHIV

vice s

NK

pro IOP ingest dat z WA

ZIK SOFTWARU A FORM

dokumentace k

5

httpwwwignaciogccomnetpreserverisksphp

M E-DEPOTU

E-

- jsou v (journals digitized masters and papers i WA)

- digitized master za 20s z pasky

-

-

-

-

- -

k

-depot a budou k dispozici v AJ vsem

e-depot

- E-depotu

-

dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd

6

V - -depotu

digitalizace v

-

-

-

xy atd) tak arconu process data store i pro management a CRM

Data model - Sledovat OPF toho co vyvinou

NTACE

filtrov f -

Projekty SCAPE a KEEP - ti v

EU ever

ISO standard pro WA kolik obsahu WA v CR

COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA

Marketing pro WA -

7

archiv le ights speci

-

Popularizace WA mezi techniky a IT komunitami

collection policy digital strategy

chaosu daleko

a pak j v v KB

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 4: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

4

A PRESERVATION GROUP

PRESERVATION METADATA PRO WEBARCHIV

BNF pracuje na implementaci PREMISu pro WA bu

ARCu nebo WARCU

s

B ARCHIV

vice s

NK

pro IOP ingest dat z WA

ZIK SOFTWARU A FORM

dokumentace k

5

httpwwwignaciogccomnetpreserverisksphp

M E-DEPOTU

E-

- jsou v (journals digitized masters and papers i WA)

- digitized master za 20s z pasky

-

-

-

-

- -

k

-depot a budou k dispozici v AJ vsem

e-depot

- E-depotu

-

dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd

6

V - -depotu

digitalizace v

-

-

-

xy atd) tak arconu process data store i pro management a CRM

Data model - Sledovat OPF toho co vyvinou

NTACE

filtrov f -

Projekty SCAPE a KEEP - ti v

EU ever

ISO standard pro WA kolik obsahu WA v CR

COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA

Marketing pro WA -

7

archiv le ights speci

-

Popularizace WA mezi techniky a IT komunitami

collection policy digital strategy

chaosu daleko

a pak j v v KB

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 5: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

5

httpwwwignaciogccomnetpreserverisksphp

M E-DEPOTU

E-

- jsou v (journals digitized masters and papers i WA)

- digitized master za 20s z pasky

-

-

-

-

- -

k

-depot a budou k dispozici v AJ vsem

e-depot

- E-depotu

-

dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd

6

V - -depotu

digitalizace v

-

-

-

xy atd) tak arconu process data store i pro management a CRM

Data model - Sledovat OPF toho co vyvinou

NTACE

filtrov f -

Projekty SCAPE a KEEP - ti v

EU ever

ISO standard pro WA kolik obsahu WA v CR

COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA

Marketing pro WA -

7

archiv le ights speci

-

Popularizace WA mezi techniky a IT komunitami

collection policy digital strategy

chaosu daleko

a pak j v v KB

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 6: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

6

V - -depotu

digitalizace v

-

-

-

xy atd) tak arconu process data store i pro management a CRM

Data model - Sledovat OPF toho co vyvinou

NTACE

filtrov f -

Projekty SCAPE a KEEP - ti v

EU ever

ISO standard pro WA kolik obsahu WA v CR

COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA

Marketing pro WA -

7

archiv le ights speci

-

Popularizace WA mezi techniky a IT komunitami

collection policy digital strategy

chaosu daleko

a pak j v v KB

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 7: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

7

archiv le ights speci

-

Popularizace WA mezi techniky a IT komunitami

collection policy digital strategy

chaosu daleko

a pak j v v KB

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 8: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

8

-

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 9: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Salt Lake City USA

Datum (od-do) -

145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -

155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011

NK konference Archiving IOP-NDK

0136

NA 662011

Datum 662011 Podpis Datum Podpis

Datum Podpis

tutorial PREMIS

---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)

httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 10: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

- archivace webu - Aplikace warc manager

- -

-

-

- P) -

Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)

- (The Church of Jesus Christ of Latter-day Saints)

-

NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170

P information lifecycle management R optimalizace storage layer mezi Grid a rosettu

- - na -

- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity

metadata conten

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 11: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D

jinak to nejde u HDD

Migrace dat z Jak se na to migrac

- HDD - - roky

a mohl by s

- - - - -

co s split migration (full split migrace)

- - - - potaz

FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)

- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -

- DPS je jejich s - -

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 12: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

The Audit and Certification of FDsys

- FDsys -auditem

TRAC httpwwwgpogovfdsys

- - TRAC hathi UNT Portico metaarchive chronopolis v

- stupnice compliancy 1- -

How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)

- - FLASH technologie 10- plny elektron

odlivu elektronu a

-

a presto se objevily chyby

- Library 1 21 - Library 2 18

- 1050 let - HDD 1-7 let -

Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)

- significant properties - dokumentech z -

Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)

- TIPR

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 13: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

- - - TIPR

-

- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v

- - succession - - - - disaster recovery a v

jako aip - migrace SW - - diversifikace - migrace dat v -

SARKK Comprehensive Digital Archive Services for Finnish Municipalities

-Savon Tietohallinto Oy (Finland)

- firma kterou v ICT long-

- - - preserv planning funkcionalitou -

-

Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA

- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant

TCO advantage over disk - 5TB na 1 cartridge sunoracle

-----------------------------------------

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 14: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)

-

atd -

o barev GAMUTem

AdobeRGB Pro FotoRGB

- a reprodukci barvy

Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker

Braunschweig (Germany)

- Reflected light - Transmited light

-spectral imaging -

o

o o o o

Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS

- -

mikrofilm sken a r - NARA 2004 Guidelines

httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf

- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 15: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Preservation in a Digital Age Jay Verkler FamilySearch (USA)

- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och

- preservation as a service

Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)

- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)

pak to pospojo

DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)

- Daitss 1 se nedal nainstalovat jinde si -) -

instructions

- v procesech

- - PSP

- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance

-

1

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 16: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

- - 80TB -

A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)

s NDIIPP v micro-services

spojeno do procesu- lze pak libovol

- - - -

-

workflow - -

- bit-streamu - cloudu (S3

httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy

funkcionalitou policy apod

Pozn z USA

v 44TB SIP v testu (10MB JP2 soubory)

komunita SDB - - vznik 2008 - -

System

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 17: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z

ltpartgt2ltpartgt

DMD data model definition z

httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228

----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)

Autographic Kodak 1916 na druhou

following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix

zastavil

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 18: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)

- a - -

o Target se detekuje automa se aby tento proce

o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic

o Alg

t o o v budoucnu vypnout aby se proces urychlil pro BATCH

proce o

Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)

- Golden thread - UTT targer -

o

-lens reflex

Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)

-

- Based on 10 SFR limiting resolution criteria how much the image information will be captured

o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o

Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 19: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

- ument kvalita parametry atd - -

- trolu

obrazu

N debata s familysearch 2052011 -------

i na problematice Digital Preservation

bude muset poskytnout zdarma

a synchronizovat

snaha spolupracovat archivy a knihovnami jejich strany

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 20: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

1

Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

CZ 10611000706386

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta

Pracovit dle organiza niacute struktury ODF 81

Pracovit za azeniacute vedouciacute odboru

D vod cesty naacutevt va konference iPRES 2011

Miacutesto m sto Singapur

Miacutesto zem Singapur

Datum (od do) 3010 5112011

Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur

111 za aacutetek konference tutorialy

211 411 konference

4 511 naacutevrat let Singapur gtDubaj gtPraha

Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)

Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny

Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami

Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK

Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 21: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

2

Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK

znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat

posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute

spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)

info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj

2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku

jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute

podrobn ji viz niacutee

Podpora publicity projektu NA

Souvisejiacuteciacute materiaacutely

Materiaacutel Miacutesto uloeniacute

sborniacutek z konference SPS sloka se zpraacutevami z SC

Datum p edloeniacute zpraacutevy 15112011

Podpis p edkladatele zpraacutevy

Datum Podpis

Podpis nad iacutezeneacuteho 15112011

Vloeno na intranet

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 22: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

3

P ijato v mezinaacuterodniacutem odd leniacute

Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech

informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to

takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li

n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling

- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute

A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business

systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv

systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a

posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji

Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi

peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)

- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor

- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 23: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

4

- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)

- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem

- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)

MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute

NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011

Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod

Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)

mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce

plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco

systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 24: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

5

ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy

vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it

tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci

- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii

Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search

- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM

JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data

pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute

naacuteklady I na uloeniacute se ukaacutezaly

Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat

Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu

eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat

Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation

P iacutenos pro NK

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 25: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

6

Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE

Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital

preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp

- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds

healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o

slubaacutech atd- Totem

P iacutenos pro NK

Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat

Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 26: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

7

webarchiving session

BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet

PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn

NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje

IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek

Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware

- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad

naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery

- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 27: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

8

Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci

mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje

Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt

emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje

Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications

- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50

- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor

Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD

Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB

R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 28: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

9

Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni

Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky

Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt

N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute

Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace

P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP

Vyacutevoj softwaru inhouse development

Pot ebovali 50 person years na 1

Zaacutev ry

migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd

V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute

Co se nau ili nem li dostate neacute analyzovanaacute stara data

Projekt management m li loose ztratili peniacuteze

Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute

Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier

lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation

- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle

- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute

obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt

stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd

- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet

- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 29: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

10

- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD

V Kb promysleli pom rn sloiteacute workflow jak to popsat atd

U kadeacuteho robota m li PC

Probleacutemy m li s radou v ci see presentation

Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot

Je to hodn lidi ne se to dostane na online

Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient

POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 30: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

11

Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment

- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd

- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute

to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci

- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka

- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru

- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems

Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13

nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win

ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky

Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to

opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute

Zaacutev ry

Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools

Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source

Pro komplex images je lepiacute Blindwrite

P iacutenos pro NK

Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd

BNF archivace webuMajiacute tri vrstvy

Harvest definition collection

Harvest instance crawling metadata

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 31: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

12

ARC files

Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance

Premis

Object agent event

Objects1 arc files a metadata arcy2 harvest instances

Harvest event in premis event creation of content files

Events reporty jako extense eventu host report a harvest report

Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst

ContainerMD

httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html

zvlaacutetniacute metadata pro v ci z Web Archivu

httpbibnumbnffrcontainerMD v1

odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat

pristi rok by merl existovsat taky jhov2 modul pro warc

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 32: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP

13

memento Meta vyhledava

P iacutenos pro NK

Jejich model archivace webu by se dal vyuiacutet v NK

Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation

action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute

kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho

odhaduji ceny procesu- Costmodelfordigitalpreservationdk

P iacutenos pro NK

K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute

Meet RODA a Full Fledged Digital Repository for Long TermPreservation

- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety

- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci

- httpredminekeepptprojectsroda public

P iacutenos pro NK

Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 33: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Zpraacuteva ze zahrani niacute sluebniacute cesty

Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)

Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital

preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka

Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment

Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4

St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks

Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)

Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt

pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech

Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 34: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute

P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky

Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy

Podpis nad iacutezeneacuteho

Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute

P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 35: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce

Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell

185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)

PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)

to collaborate on requiring and implementing rigorous and independent tests

DNB Contribution to the Tallinn Alignment Sabine Schrimpf

key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads

o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components

o redundant storage at different locationso KOLiBRI Modules

LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research

o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies

EDINA THE UK LOCKSS Alliance Adam Russbridge

EDINA offers underlying technical support amp coordinationthreats to digital stewardship

o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)

o attack (insideroutsider)o operator error

source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP

Public testing Michael Seadle

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 36: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired

o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results

o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith

Presentation without a title Andy Rauber

evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)

o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios

necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these

Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)

focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)

o present a distributed national digital collection for the benefit of citizens

The European Research Arena David Giaretta

technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER

o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago

o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification

Observation from the MetaArchive Cooperative program Martin Halbert

distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical

locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP

o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA

o 39 members national university libraries + other organizations (Internet Archive)

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 37: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)

DAY 2

Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin

2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen

o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish

National Archive

Raivo Ruusalepp

standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security

o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)

o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment

o better use of community standards for information security and preservationo agreement on security requirements

Standards based approach to preservation planning MatthewWoollard

ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit

Best Practices amp Standards Bram van der Werf

self assessment trust auditscertification trustISO 30300 (draft) Record Management

PANEL 4 Legal Alignment

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 38: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful

Breakout Session

standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment

o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards

DAY 3PANEL 5 Educational Alignment

key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)

o sharing toolso national programs

related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services

focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)

Presentation without a title Neil Grindley

how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg

Page 39: Marek Melichar ODODD . . . Preservation Working Group HAAG · 1 Marek Melichar ODODD . . . Preservation Working Group HAAG Datum (od-do) - . . 10. 5. Cesta do Haagu 11. 5. Haagu 12

JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest

o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest

Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub

solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg