Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
1
Marek Melichar ODODD Preservation Working Group
HAAG
Datum (od-do) - 10 5 Cesta do Haagu
11 5 Haagu 12 5 930 Haagu pak cesta do Prahy
NK 0 136
viz
p 10 5 - 1150 Odlet z
11 5 - General
12 5 - General
1900 Odlet z Amsterdam Schiphol
25 5 2011
2
PINY PRESERVATION GROUP IIPC 11 A 12 5 2011 HAAGUE
Marek Melichar 13 5 2011
OBSAH
2
3
JHOVE 2 3
Migrace ARC gt WARC 3
4
Preservation metadata pro webarchiv 4
4
4
-depotu 5
6
3
KUPINY IIPC WG PRO PRESERVATION V
JHOVE 2 BN firmou ATOS Origin
WARC nebo html Bude se jednat o tom aby z
Jhove 1
ziskat s webarchivu Tzn Nainstalovat jhove2 v
infrastruktPokud bychom byli schopni proces validace monitorovat z
MIGRACE ARC gt WARC Debata o migraci ARC do WARCu je to preservation action-
Porovnat ARC a WARC a podle toho se rozhodnout c
4
A PRESERVATION GROUP
PRESERVATION METADATA PRO WEBARCHIV
BNF pracuje na implementaci PREMISu pro WA bu
ARCu nebo WARCU
s
B ARCHIV
vice s
NK
pro IOP ingest dat z WA
ZIK SOFTWARU A FORM
dokumentace k
5
httpwwwignaciogccomnetpreserverisksphp
M E-DEPOTU
E-
- jsou v (journals digitized masters and papers i WA)
- digitized master za 20s z pasky
-
-
-
-
- -
k
-depot a budou k dispozici v AJ vsem
e-depot
- E-depotu
-
dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd
6
V - -depotu
digitalizace v
-
-
-
xy atd) tak arconu process data store i pro management a CRM
Data model - Sledovat OPF toho co vyvinou
NTACE
filtrov f -
Projekty SCAPE a KEEP - ti v
EU ever
ISO standard pro WA kolik obsahu WA v CR
COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA
Marketing pro WA -
7
archiv le ights speci
-
Popularizace WA mezi techniky a IT komunitami
collection policy digital strategy
chaosu daleko
a pak j v v KB
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
2
PINY PRESERVATION GROUP IIPC 11 A 12 5 2011 HAAGUE
Marek Melichar 13 5 2011
OBSAH
2
3
JHOVE 2 3
Migrace ARC gt WARC 3
4
Preservation metadata pro webarchiv 4
4
4
-depotu 5
6
3
KUPINY IIPC WG PRO PRESERVATION V
JHOVE 2 BN firmou ATOS Origin
WARC nebo html Bude se jednat o tom aby z
Jhove 1
ziskat s webarchivu Tzn Nainstalovat jhove2 v
infrastruktPokud bychom byli schopni proces validace monitorovat z
MIGRACE ARC gt WARC Debata o migraci ARC do WARCu je to preservation action-
Porovnat ARC a WARC a podle toho se rozhodnout c
4
A PRESERVATION GROUP
PRESERVATION METADATA PRO WEBARCHIV
BNF pracuje na implementaci PREMISu pro WA bu
ARCu nebo WARCU
s
B ARCHIV
vice s
NK
pro IOP ingest dat z WA
ZIK SOFTWARU A FORM
dokumentace k
5
httpwwwignaciogccomnetpreserverisksphp
M E-DEPOTU
E-
- jsou v (journals digitized masters and papers i WA)
- digitized master za 20s z pasky
-
-
-
-
- -
k
-depot a budou k dispozici v AJ vsem
e-depot
- E-depotu
-
dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd
6
V - -depotu
digitalizace v
-
-
-
xy atd) tak arconu process data store i pro management a CRM
Data model - Sledovat OPF toho co vyvinou
NTACE
filtrov f -
Projekty SCAPE a KEEP - ti v
EU ever
ISO standard pro WA kolik obsahu WA v CR
COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA
Marketing pro WA -
7
archiv le ights speci
-
Popularizace WA mezi techniky a IT komunitami
collection policy digital strategy
chaosu daleko
a pak j v v KB
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
3
KUPINY IIPC WG PRO PRESERVATION V
JHOVE 2 BN firmou ATOS Origin
WARC nebo html Bude se jednat o tom aby z
Jhove 1
ziskat s webarchivu Tzn Nainstalovat jhove2 v
infrastruktPokud bychom byli schopni proces validace monitorovat z
MIGRACE ARC gt WARC Debata o migraci ARC do WARCu je to preservation action-
Porovnat ARC a WARC a podle toho se rozhodnout c
4
A PRESERVATION GROUP
PRESERVATION METADATA PRO WEBARCHIV
BNF pracuje na implementaci PREMISu pro WA bu
ARCu nebo WARCU
s
B ARCHIV
vice s
NK
pro IOP ingest dat z WA
ZIK SOFTWARU A FORM
dokumentace k
5
httpwwwignaciogccomnetpreserverisksphp
M E-DEPOTU
E-
- jsou v (journals digitized masters and papers i WA)
- digitized master za 20s z pasky
-
-
-
-
- -
k
-depot a budou k dispozici v AJ vsem
e-depot
- E-depotu
-
dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd
6
V - -depotu
digitalizace v
-
-
-
xy atd) tak arconu process data store i pro management a CRM
Data model - Sledovat OPF toho co vyvinou
NTACE
filtrov f -
Projekty SCAPE a KEEP - ti v
EU ever
ISO standard pro WA kolik obsahu WA v CR
COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA
Marketing pro WA -
7
archiv le ights speci
-
Popularizace WA mezi techniky a IT komunitami
collection policy digital strategy
chaosu daleko
a pak j v v KB
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
4
A PRESERVATION GROUP
PRESERVATION METADATA PRO WEBARCHIV
BNF pracuje na implementaci PREMISu pro WA bu
ARCu nebo WARCU
s
B ARCHIV
vice s
NK
pro IOP ingest dat z WA
ZIK SOFTWARU A FORM
dokumentace k
5
httpwwwignaciogccomnetpreserverisksphp
M E-DEPOTU
E-
- jsou v (journals digitized masters and papers i WA)
- digitized master za 20s z pasky
-
-
-
-
- -
k
-depot a budou k dispozici v AJ vsem
e-depot
- E-depotu
-
dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd
6
V - -depotu
digitalizace v
-
-
-
xy atd) tak arconu process data store i pro management a CRM
Data model - Sledovat OPF toho co vyvinou
NTACE
filtrov f -
Projekty SCAPE a KEEP - ti v
EU ever
ISO standard pro WA kolik obsahu WA v CR
COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA
Marketing pro WA -
7
archiv le ights speci
-
Popularizace WA mezi techniky a IT komunitami
collection policy digital strategy
chaosu daleko
a pak j v v KB
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
5
httpwwwignaciogccomnetpreserverisksphp
M E-DEPOTU
E-
- jsou v (journals digitized masters and papers i WA)
- digitized master za 20s z pasky
-
-
-
-
- -
k
-depot a budou k dispozici v AJ vsem
e-depot
- E-depotu
-
dokumentace procesu v -depotu pro reporting management IT cost analyses atd atd
6
V - -depotu
digitalizace v
-
-
-
xy atd) tak arconu process data store i pro management a CRM
Data model - Sledovat OPF toho co vyvinou
NTACE
filtrov f -
Projekty SCAPE a KEEP - ti v
EU ever
ISO standard pro WA kolik obsahu WA v CR
COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA
Marketing pro WA -
7
archiv le ights speci
-
Popularizace WA mezi techniky a IT komunitami
collection policy digital strategy
chaosu daleko
a pak j v v KB
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
6
V - -depotu
digitalizace v
-
-
-
xy atd) tak arconu process data store i pro management a CRM
Data model - Sledovat OPF toho co vyvinou
NTACE
filtrov f -
Projekty SCAPE a KEEP - ti v
EU ever
ISO standard pro WA kolik obsahu WA v CR
COST Recollection tool loc gov SEE WEB Quality assurence v kontextu WA
Marketing pro WA -
7
archiv le ights speci
-
Popularizace WA mezi techniky a IT komunitami
collection policy digital strategy
chaosu daleko
a pak j v v KB
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
7
archiv le ights speci
-
Popularizace WA mezi techniky a IT komunitami
collection policy digital strategy
chaosu daleko
a pak j v v KB
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
8
-
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Salt Lake City USA
Datum (od-do) -
145 odlet Praha- - 165 workshop PREMIS-+ 17-195 konference Archiving2011 205 odlet SLC- -
155 odlet Washington-Deitroid-Salt Lake City 155 workshop T1D Color in Image Capture Archiving T2A ScannerampCamera Imaging Performance Benchmarking compliance and Workflow Monitoring 17-195 konference Archiving2011
NK konference Archiving IOP-NDK
0136
NA 662011
Datum 662011 Podpis Datum Podpis
Datum Podpis
tutorial PREMIS
---------------------------------------------------------- Implementation of a High Performance Architecture for Managing and Storing Web-Harvested Collections Michael Smorul and Joseph JaJa University of Maryland (USA)
httpswikiumiacsumdeduadaptimages66bArchiving11-smorulpdf
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
- archivace webu - Aplikace warc manager
- -
-
-
- P) -
Using Tape for Large-Scale Digital Preservation Gary Wright FamilySearch (USA)
- (The Church of Jesus Christ of Latter-day Saints)
-
NDK - DRPS Ingest Tools Typ storage storage grid NetApp FAS3170
P information lifecycle management R optimalizace storage layer mezi Grid a rosettu
- - na -
- human error - - - chyby HW - validace integrity dat - - - - maximalizace - verifikace integrity
metadata conten
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Moving On When it is Time to Re-Archive Michael Selway Quantum Corporation (USA) CHW) D
jinak to nejde u HDD
Migrace dat z Jak se na to migrac
- HDD - - roky
a mohl by s
- - - - -
co s split migration (full split migrace)
- - - - potaz
FamilySearch An End-to-End Process for Scanning Characterizing Preserving and Providing Access to Very Large Collections of Vital Records Tom Creighton FamilySearch (USA) Jonathan Tilbury Tessella plc (UK) and Mark Evans Tessella Inc (USA)
- Ve FamilySearch mikrofilmy od poloviny 19 st - - - - - s -
- DPS je jejich s - -
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
The Audit and Certification of FDsys
- FDsys -auditem
TRAC httpwwwgpogovfdsys
- - TRAC hathi UNT Portico metaarchive chronopolis v
- stupnice compliancy 1- -
How Long is Long-Term Data Storage (Focal) Barry M Lunt Brigham Young University and Douglas Hansen Wayne Rust and Mark Worthington Millenniata Inc (USA)
- - FLASH technologie 10- plny elektron
odlivu elektronu a
-
a presto se objevily chyby
- Library 1 21 - Library 2 18
- 1050 let - HDD 1-7 let -
Quality Assurance of Digital Information in Long-Term Digital Preservation University of Technology (Sweden)
- significant properties - dokumentech z -
Towards Interoperable Preservation Repositories Repository Exchange Package Use Cases and Best Practices Joseph Pawletko New York University and Priscilla Caplan Florida Center for Library Automation (USA)
- TIPR
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
- - - TIPR
-
- RXP mets a premis semantika soubory metadat ne v metsu ale vedle v
- - succession - - - - disaster recovery a v
jako aip - migrace SW - - diversifikace - migrace dat v -
SARKK Comprehensive Digital Archive Services for Finnish Municipalities
-Savon Tietohallinto Oy (Finland)
- firma kterou v ICT long-
- - - preserv planning funkcionalitou -
-
Magnetic Tape Technology economic advantages for preservation Gary Francis Oracle USA
- Oracle prezentace - - clipper group 2010 in search for the long-term archiving solution tape delivers significant
TCO advantage over disk - 5TB na 1 cartridge sunoracle
-----------------------------------------
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Color In Digital Preservation Robert Buckley University of RochesterNewMarket Imaging Steven Puglia National Archives and Records Administration and Michael Stelmach Library of Congress (USA)
-
atd -
o barev GAMUTem
AdobeRGB Pro FotoRGB
- a reprodukci barvy
Multispectral Image Archiving of Watermarks in Historical Papers Peter Meinlschmidt Wilhelm-Klauditz-Institut Fraunhofer-Institute for Wood Research and Volker
Braunschweig (Germany)
- Reflected light - Transmited light
-spectral imaging -
o
o o o o
Implementing a Quality Assurance Program for Monitoring Scanner Performance Michael J Horsley and John T Berezich National Archives and Records Administration (USA) DAITSS
- -
mikrofilm sken a r - NARA 2004 Guidelines
httpsdocsgooglecomviewerurl=http3A2F2Fwwwarchivesgov2Fpreservation2Ftechnical2Fguidelinespdf
- Metamorfose - Atd viz sildes - Quantitive performance slides - Web based database sharepoint zace -
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Preservation in a Digital Age Jay Verkler FamilySearch (USA)
- Tessellou pro SDB budou k 1 - - - data loss is intrinsic gt - - - och
- preservation as a service
Curation of the End-of-Term Web Archive Classification and Metrics Kathleen Murray Lauren Ko and Mark Phillips University of North Texas (USA)
- - eotcd archiv httpresearchlibraryuntedueotcdwikiMain_Page - - pro - 16TB dat - - (ne Warcy)
pak to pospojo
DAITSS Grows Up Migrating to a Second Generation Preservation System (Focal) Priscilla Caplan and Carol Chou Florida Center for Library Automation (USA)
- Daitss 1 se nedal nainstalovat jinde si -) -
instructions
- v procesech
- - PSP
- refresh funkce - disseminate- do a refresh and export new AIP as DIP - withdraw remove AIP from storage retaining provenance
-
1
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
- - 80TB -
A Community Driven Micro Services Architecture Supporting Long Term Digital Preservation Mark Evans and Bill Steel Tessella Inc (USA) and Robert Sharpe James Carr Alan Gairey and Jonathan Tilbury Tessella plc (UK)
s NDIIPP v micro-services
spojeno do procesu- lze pak libovol
- - - -
-
workflow - -
- bit-streamu - cloudu (S3
httpawsamazoncoms3) SaaS funkcionalitu bude brzy ti tenancy
funkcionalitou policy apod
Pozn z USA
v 44TB SIP v testu (10MB JP2 soubory)
komunita SDB - - vznik 2008 - -
System
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Lisa LaPlant and Blake Edwards US Government Printing Office (USA) FDsys OAIS dig MODS - z
ltpartgt2ltpartgt
DMD data model definition z
httpwwwgpogovfdsyssearchpagedetailsactionst=pragueampgranuleId=amppackageId=DCPD-200900228
----------------------------------------- Preservation Starts from the Beginning Michael Wash US Department of Transportation (USA)
Autographic Kodak 1916 na druhou
following the No 3A Autographic Kodak Special of 1916 which was the first rangefinder camera It had a Kodak Anastigmat f63 lens and a Kodamatic shutter with speeds from 12 to 1200 sec plus bulb and time mode httpcamerapediawikiacomwikiNo_1A_Autographic_Kodak_Special Kodak Advantix
zastavil
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Colorite A Flexible Cross-Platform Software Solution for Automatic Image QualityAnalysis Using Arbitrary Targets Henrik Johansson National Library of Sweden (Sweden)
- a - -
o Target se detekuje automa se aby tento proce
o TIFF JPEG JP2 PNG o o -of-art feature based image matching algoritm ImageMagic
o Alg
t o o v budoucnu vypnout aby se proces urychlil pro BATCH
proce o
Henrika Johanssona What if the Image Quality Analysis Rates My D Dietmar Wueller Image Engineering (Germany)
- Golden thread - UTT targer -
o
-lens reflex
Establishing Resolution Requirements for Digitizing Transmissive Content A Use Case Approach Michael Stelmach Library of Congress Don Williams Image Science Associates LLC and Steven Puglia National Archives and Records Administration (USA)
-
- Based on 10 SFR limiting resolution criteria how much the image information will be captured
o 1 polovina 20 stol 1200-1600 PPI o 2 polovina 20 stol up to 2800 PPI o o
Digitise More Pay Less Optimising the Workprocess for both Heritage Institute and Imaging Provider Olaf Slijkhuis Pictura Imaginis (the Netherlands)
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
- ument kvalita parametry atd - -
- trolu
obrazu
N debata s familysearch 2052011 -------
i na problematice Digital Preservation
bude muset poskytnout zdarma
a synchronizovat
snaha spolupracovat archivy a knihovnami jejich strany
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
1
Zpraacuteva ze sluebniacute cestyProjekt Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
CZ 10611000706386
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Jan Huta
Pracovit dle organiza niacute struktury ODF 81
Pracovit za azeniacute vedouciacute odboru
D vod cesty naacutevt va konference iPRES 2011
Miacutesto m sto Singapur
Miacutesto zem Singapur
Datum (od do) 3010 5112011
Podrobnyacute asovyacute harmonogram 30 3110 let Praha DubajgtSingapur
111 za aacutetek konference tutorialy
211 411 konference
4 511 naacutevrat let Singapur gtDubaj gtPraha
Spolucestujiacuteciacute z NK Mgr Marek Melichar (hrazeno z projektu 0136)
Finan niacute zajit niacute IOP Vytvo eniacute Naacuterodniacute digitaacutelniacute knihovny
Vztah k projektu ziacuteskaacuteniacute novyacutech informaciacute o problematice digitalpreservation o projektech v ostatniacutech knihovnaacutechkonzultace s kolegy a firmami
Ciacutele cesty viz vztah k projektu vyuiacutet vekereacute vyacutestupy pro plaacutenovaacuteniacutea chod projektu NDK vyuiacutet pro budouciacute eeniacuteproblematiky digital preservation v NKNDK
Pln niacute ciacutel cesty spln no viz podrobnyacute zaacutepis niacutee a sborniacutek na SPS
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
2
Daliacute podrobn jiacute informace SHRNUTIacute A P IacuteNOS K PROJEKTU NDK
znatelnyacute naacutestup eeniacute dlouhodobeacute ochranypomociacute emulace (v minulyacutech letech migrace) gtoba p iacutestupy se zdaacute se budou dopl ovat
posun k ochran komplexniacutech dat databaacutezeapod NK zatiacutem ne eiacute
spousta p iacutesp vk pouitelnaacute i do NK a NDK(webarchivace a ochrana v NK Francie audityemulace info o SDB systeacutemu (Tessella) a osysteacutemu RODA certifikace viz Rouchon apod)
info o probleacutemech a eeniacute vyuitiacute v reaacutelneacutemprost ediacute naacutestroj typu JHOVE PRONOM aj
2 p iacutesp vky o zaacutelohovaacuteniacute optickyacutech disk aktuaacutelniacute probleacutem i v NK ideaacuteln naacutesledovatpopsaneacute postupy ve sborniacuteku
jasnaacute pot eba mezinaacuterodniacute spolupraacutece adodrovaacuteniacute standard tak aby takovaacute spolupraacutecebyla monaacute
podrobn ji viz niacutee
Podpora publicity projektu NA
Souvisejiacuteciacute materiaacutely
Materiaacutel Miacutesto uloeniacute
sborniacutek z konference SPS sloka se zpraacutevami z SC
Datum p edloeniacute zpraacutevy 15112011
Podpis p edkladatele zpraacutevy
Datum Podpis
Podpis nad iacutezeneacuteho 15112011
Vloeno na intranet
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
3
P ijato v mezinaacuterodniacutem odd leniacute
Seamus Ross digital curation and preservation- preserving data sets vyuitiacute statistickyacutech dtb a vyacutezkumnyacutech dat vs ochrana textovyacutech
informaciacute- personal data u nejsou jen fotky v krabici (Flicker apod)- banky spousta osobniacutech dat vyuitiacute v budoucnu pro historiky nutno uchovat instituce to
takeacute d lajiacute- viz mckinseycom big data full report pdf- pro tedy d lat DP slide 16 budouciacute generace to o ekaacutevajiacute pro historiky v dce aby m li
n jakeacute zdroje odkaz o sou asnosti pro budoucnost information ecosystem to enablestorytelling
- d raz od ochrany textovyacutech informaciacute na ochranu komplexniacutech databaacuteziacute
A capability model for DP Ch Becker et al- vyacutezkum v raacutemci projekt shaman a scape projektu- sos systeacutems of systems- 3 druhy systeacutem uml- DPS jako funk niacute requirement- SoS business systeacutem systeacutem v systeacutemu data se pak sypou do DPS- DPS kde DP neniacute funk niacute requirement ale p esto to d laacute (DP ready systeacutem) business
systeacutem s DP funkcionalitou- jak ale do enterprise systeacutem DP dostat model pro implementaci DP do jakeacutehokoliv
systeacutemu v raacutemci projektu shaman capability based reference architecture- governance business and technical (operation) capability podklad pro rozhodnutiacute a
posouzeniacute stavu- capability maturity model CMM procesy posouzeniacute a zlepeniacute s SW vyacutevoji
Olivier Rouchon certification and quality at Cines- uklaacutedajiacute these digitalizovaneacute v ci multimeacutedia dokumenty data sets v deckeacute- datoveacute centrum pro celou Francii- majiacute odborniacuteky na formaacutety xml 11 lidiacute- 15TB dat- certifikace naacuterodniacute zaacutekon cines je naacuterodniacute centrum pro DP thesiacute majiacute na to odd leniacute lidi
peniacuteze postup a p iacutepravy viz niacutee- p iacuteprava na certifikaci testovaacuteniacute drambory DSA TRAC ISO 16363 a ISO 16919- krok 2009 drambora audit 2 kontroly risk za rok jak se postupuje s jejich eeniacutem- krok formalizace business proces 14 proces dle ISO 9001- management operational a support processes (presentovaacuteno na ipres2010)
- 2009 externiacute pre audit 2 lidi 19 man days zaloeno na vech dostupnyacutech standardechpomociacute konltroly dokumentace rozhovor
- 2010 SIAF audit 4 m siacutece d laacute to NA Francie pro kadyacute archiv kt uklaacutedaacute ve ejnaacute datadelajiacute audit kadeacute 3 roky zpraacuteva m la 800 stran
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
4
- 2010 data seal of approval sou aacutest EU framework for audit and certification of trustedrepositories (MoU mezi t emi aktivitami na certifikaci)
- 2011 v raacutemci projektu aparsen d lali takeacute ISO 16363 spolu s DANS a UKDA prochaacutezeli tiacutemauditem
- nap ed internal leden a duben 2011 (60 man days) pak external 12 odborniacutek (KB BLNASA apod) v ervnu 2011 (3 dny)
MoU DSAgtISO 16363 jako druhyacute krok (internal) gt ISO 16363 extendedaudit je rychlejiacute tiacutem iacutem viacutec jich d laacutete tj pokud je to pravideln neniacute to tak asov naacutero neacute
NK Noveacuteho Zeacutelandu prola certifikaciacute TRAC na podzim 2011
Andreas Rauber dopad preservation actions na repozitaacute eco se d je se samotnyacutem repozitaacute emsimulace repozitaacute e RepoSimkv li analyacuteze na testovaacuteniacute migrace co se stane kdy fily se budou zv tovat co kdy v repubudeme miacutet viacutece typ formaacutet apodRepoSim simulaacutetor flexibilniacute irregular patternszatiacutem interniacute verze hibernate java mysqljde naspecifikovat jakeacute formaacutety p ijiacutemaacute jejich popis ingest nastaveniacute hypotetickeacute naacutestroje(hlavn na migraci) nastaveniacute pravidel na ochranneacute aktivity (migrace do jakeacuteho formaacutetu jakeacuteverze jakeacute soubory kolikraacutet pravidla + filtry)monost spustit virtuaacutelniacute migraci vzniknou grafy kt eknou jak to bude dlouho trvat apodco jak na co a po jakou dobu migrovat prob hne virtuaacuteln uvidiacuteme vyacutesledekdobreacute na plaacutenovaacuteniacute pro IT a HWdobreacute na plaacutenovaacuteniacute r znyacutech sceacutenaacute porovnaacuteniacute s p edpoklaacutedanyacutem vyacutevojem plaacutenovaacuteniacute rozvojeHW a investicmusiacute dod lat jet monost zadat deletion policies reporty apod
Joseacute Barateiro Risk assessment in DP of e science data and processesDP as risk managementISO 31000 definice risk managementupodobneacute jako dramborak risk managementu je mnoho standardrozvedeniacute metodiky iso 31000 na jednotliveacute krokyTIMBUS project httptimbusprojectnet jedniacutem z partner je i SAP (N mecko)
mad talks- open source SW pro LTP RODA je zpaacutetky rozviacutejiacute se v raacutemci SCAPE projektu noveacute funkce
plaacuteny na rozvoj a vznik uivatelskeacute komunity- 4 postery o emulaci Emulace v raacutemci KEEP emulace pro studovny v knihovnaacutech OPF eco
systeacutem registry- TOTEM metadatovyacute standard pro popis technickeacuteho prost ediacute pro emulaci
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
5
ANDS Ross Wilkinson- datovaacute centra v Austraacutelii min 3 pro r zneacute oblasti ivota- ANDS existuje skoro 3 roky peniacuteze od aus vlaacutedy- obrovskeacute mnostviacute dat nikdy nebudou vyuita tena lov kem jen automatickeacute procesy
vyt eniacute- nutnost uklaacutedat a ochra ovat research data protoe u nemusiacute byacutet moneacute je znovu vytvo it
tak aby je lo znovu pouiacutet aby bylo moneacute z nich vyvodit noveacute zaacutev ry aby je m li k dispoziciv dci
- nutno d lat ve spolupraacuteci nelze pouze z titulu jedneacute instituce- kdo eiacute uloeniacute v deckyacutech dat v R Akademie v d CESNET- podobnaacute datovaacute centra jsou I ve Velkeacute Britaacutenii
Rob Sharpe Considerations for High Throughput Digital PreservationPrezentace firmy Tessella Jejich testovaacuteniacute vyacutekonu ingestu do SDB ve Family Search
- SDB vznikaacute od roku 2002 kdy prvniacutem zaacutekazniacutekem byl National Archive UK- novyacute zaacutekazniacutek UK parlament- test s FamilySearch- 20TB ingest za den skenovaneacute materiaacutely workflow s antivirem charakterizaciacute (PRONOM
JHOVE) apod- 1 package je zhruba 1GB 20tis baliacute k za den- 2 servery dell poweredge R710 cena dohromady max 20000 Liber- ukaacutezalo se e limitujiacuteciacute je rychlost teniacute disk na kt jsou na po aacutetku ingestu uloena data
pot ebovali tedy 130 paralelnich disk (50tis liber)- uloeno na paacutesky taky pomaleacute pot ebovali tedy 8 paralelniacutech zaacutepis na paacutesky (30tis liber)- uloeniacute stojiacute 100 liber za TB- 73peta za rok- zaacutev r zaacutepis a teniacute je pomaleacute naacutestroje jako jhove a pronom dostate n rychleacute vysokeacute
naacuteklady I na uloeniacute se ukaacutezaly
Pro ingest dat z projektu Family Search pot ebovali zajistit prostupnost 20TB dat denn p i zachovaacuteniacutedostate nyacutech procedur pro zpracovaacuteniacute dat podle poadavk OAIS a zadavatele V projektu lo o toidentifikovat uacutezkaacute hrdla ingestu velkeacuteho mnostviacute dat
Procesy jako generovaacuteniacute hash nebo jejich kontrola identifikace formaacutet a extrakce technickyacutechmetadat vyadujiacute obvykle velkyacute p i velkyacutech objemech rychlyacute storage systeacutem V projektu family searchcht jiacute do SDB ingestovat (content aquisition content preparation ingestfixity check contentmetadata integrity check charakterizace tj identifikace a validace formaacutet a extrakce tech MD) max700MB za sekundu
eili jak takoveacute masivniacute workflow efektivn paralelizovat p i minimalizaci naacuteklad Podle jejichzjit niacute paralelizace umo uje obejiacutet probleacutemy s vyacutekonem naacutestroj jako DROID a JHOVE celkovvyacutekon softwaru nebyl oproti jejich o ekaacutevaacuteniacute probleacutem V tiacute probleacutemy jsou v HW aby byl schopendostate n rychle zapisovat
Tj uacutezkeacute hrdlo bylo v HW a p esunech dat z miacutesta na miacutesto spi ne ve vyacutekonu naacutestroj pro digitalpreservation
P iacutenos pro NK
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
6
Nebaacutet se vyacutekonu SW jako DROID nebo JHOVE
Ross King Evolving domains problems and solutions for LT DP- info o projektech SCAPE apod- programme httpcordiseuropaeufp7icttelearn digicultreport research digital
preservation_enpdf Stephan Strodl Vienna University of Technology AustriaPetar PetrovVienna University of Technology Austria Andreas Rauber Vienna University of TechnologyAustriaP knyacute Timeline for preservation projects whitepaper about the past of european dp
- Finance vydaneacute na vyacutezkum DP postupn rostou Projekty a finance nic nevy eiacute-- ARCOMEM archivace webarchiv socially driven web preservation model- social web analysis- archive enrichment- ENSURE evaluation between cost and value automatizace ochranneacuteho cyklu testbeds
healthcare clinical trials financial services- SCAPE- presevation planning and action workflows jak je ud lat kaacutelovatelneacute- vytvo eniacute infrastruktury pro kaacutelovatelneacute akce ochrany- vyacutevoj policy based preservation planning naacutestroje s automatickou preservation watch- 3 testbeds wa larg scale repositories research data sets- vechny projekty vytvo iacute prototypniacute SW- digital lifecycle approach- preservatin planning hraje roli ve vech t chto projektech spolu s virtualizaciacute- slide s trendy v DP za posledniacute roky- Research on Digital Preservationwithin projects co funded bythe European Union in the ICT- Ensure- Scape- Wf4Ever httpwwwwf4ever projectorgabout- Timbus sw nestaci soustredi se na kontext organizaci LTP neniacute o objektech jen ale o
slubaacutech atd- Totem
P iacutenos pro NK
Sledovat projekty v oblasti dlouhodobeacute ochrany digitaacutelniacutech dat Posledniacute projektu EU jako SCAPEpovedou k urychleniacute vyacutevoje konkreacutetniacutech naacutestroj pro dlouhodobou ochranu digitaacutelniacutech dat
Record keeping in temporary command settings Erik Borglundochrana dokumentace ke krizovyacutem situaciacutem vzniklyacutech z innosti policie apodjak zachytit kontext lze uchovat flipcharty videa zaacutepisy ale kontextu analogovyacutech dokument neniacute probleacutem probleacutem je s digitaacutelniacutemi v cmi a rozhovorym l by se o to starat naacuterodniacute archiv ten ovem bere jen papiacuteroveacute dokumenty nebo nap fotky zmiacutesta jednaacuteniacute otaacutezka archivace spisoveacuteho materiaacutelu je to sameacute jako archivace pr b hu jednaacuteniacutev digitaacutelniacute podob
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
7
webarchiving session
BnF 200TB webarchivovanyacutech dat15 milion ARC musiacute je charakterizovat validovat asov naacutero neacuteuklaacutedajiacute v shared repositorySPAR (LTP systeacutem Francouzskeacute NK) maacute kapacitu 16PBpouiacutevajiacute jhove2 na charakterizaci vytvaacute ejiacute modul na arcynecht jiacute d lat charakterizaci a validaci pro obsah arc jen identifikaci formaacutet
PREMIS v METSu by byl p iacuteli dlouhyacute budou tedy zapisovat jen metadata na uacuterovniinforma niacuteho baliacuteku (AIP) kt jsou stejnaacute pro celyacute baliacutek resp 1 vlastnost se vyjaacuted iacute a pak se ktomu jen p idaacute informace o tom kt fily tomu odpoviacutedajiacute namiacutesto opakovaacuteniacute teacute infromace prokadyacute filevytvo ili speciaacutelniacute metadatovyacute formaacutettj jsou schopni se LTP systeacutemu zeptat dej mi vechny informa niacute baliacute ky ktereacute obsahujiacute formaacutetXY apod neniacute ale t eba indexovat metadata t ch obsah to by trvalo dlouho stejnyacute p iacutestupmajiacute I pro digitalizovaneacute knihyr zneacute DP policy a uacuterovn validace pro r zneacute typy wa dat kompletniacute sklizn vs teacutematickeacute sklizn
NL NZ2 sklizn 20 TB dohromadyeiacute metadata kolik metadat je hodn a kolik maacutelopolicy knihovny iacutekaacute e se musiacute uklaacutedat co nejviacutece metadat to by byl ovem z hlediska velikostimetadat probleacutempro selektivniacute webarchvest majiacute hotoveacute workflow WCT ve se katalogizuje
IA16 miliard URLnejstariacute z roku 19963TB za den 1PB za rok je p iacuterustek
Euan Cochrane Dirk von Suchodoletz Replicating InstalledApplication and Information Environments onto Emulated orVirtualized Hardware
- zachyceniacute uchovaacuteniacute celkoveacuteho prost ediacute na emulovanyacute HW- nap vziacutet prost ediacute desktopu p edsedy vlaacutedy a uloit v archivu- probleacutemy se zobrazeniacutem- computer forensic- monost pro ochranu v deckyacutech dat a zaacuteznam- celeacute je to o tom jak replikovat HDD a pustit prost ediacute kt na n m je ve virtuaacutelniacutem prost ediacute- eeniacute- vykuchali HDD z n kolika staryacutech PC gt identifikovat naacuteroky na HW (analyacuteza HDD gt odhad
naacuterok automaticky je to sou aacutest kadeacuteho PC prost ediacute) gt vybrat emula niacutevirtualiza niacute SW(tool registry jako nap TOTEM z projektu KEEP) gt uacuteprava HDD na disk image vhodnyacute proemulaci gt zkusit nabootovat image disku na emulovaneacutem HW gt p idat drivery
- probleacutemy s licencemi ochranou osobniacutech dat autenticitou (20 v ciacute se zm niacute barvy apod)- QEMU sparc processor emulator
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
8
Klaus Rescher Remote Emulation for Migration Services in aDistributed Preservation Frameworkpouitiacute emulace jako naacutestroje pro migraci
mnohdy nejsou dostupneacute naacutestroje pro migraci ur ityacutech formaacutetDig objekt vloiacuteme do emulovaneacuteho prost ediacute (virtuaacutelniacuteho stroje) pak ho vidiacuteme v prost ediacuteemulovaneacuteho systeacutemu m eme ho otev iacutet v p vodniacute nebo vhodneacute aplikaci uloit jako jinyacuteformaacutet a uloit op t do virtuaacutelniacuteho stroje
Bram Lohman Emulation as a Business Solution the EmulationFramework Keep projekt
emulation framework 7 emulaacutetor 6 platforem (x86 Amiga aj) 23 file formaacuteteeniacute pro spraacutevu emula niacutech naacutestrojsetup emula niacutech procesprost ediacute kt obsahuje emulaacutetory a pokud do n j nahrajeme aplikaci nebo soubor m l by sespustit jako v p vodniacutem prost ediacuteprost ediacute obsahuje I naacutestroj kt u soubor ukaacutee jakyacute je to formaacutet a jakeacute prost ediacute je pot eba projeho sput niacute na zaacuteklad PRONOMu rovnou lze to prost ediacute p ipravit a soubor v n m spustit na te SW image z databaacuteze aplikaciacute OPF kteraacute se buduje
Geofrey Brown Developing Virtual CD ROM Collections The VoyagerCompany Publications
- publikace konkreacutetniacuteho vydavatelstviacute na CD interaktivniacute aplikace pro Mac z 200 vydanyacutech jenyniacute dostupnyacutech pouze asi 50
- emulace do dneniacutech systeacutem- hdd snapshot p iacutemo v emulaacutetoru tj je to na jedno kliknutiacute a velmi rychleacute- sheepshaver emulaacutetor
Evaulation of danish large migration projectP ed rokem 1998 nem li formaacutety stanoveneacute zaacutekonemMezi rokem 2005 a 8 zavedli standardyHodnoceni se tyacutekaacute stanovenyacutech standard a migrace do nich v naacuterodniacutem archivuHodnoceni d lali pro toho kdo to financovalMezi rokem 2005 a 8 straacutevili 30 person years na migraci m li 10=15 lidi na to investovalo190 tis USD celkoveacute naacuteklady 26 milionu USD
Neniacute to moc dat reaacutelneacute co migrovali asi 1777GB
R zneacute aacutesti archivu tapes data o populaci data na cd r registries a data elektronicky pln naNemohli p e iacutest vechny soubory zvlaacute na paacuteskach5 r znyacutech typu pasekN ktereacute museli za draheacute peniacuteze zachra ovat
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
9
Celkoveacute naacuteklady na vyrobeniacute preservation standardu 10 men years 12 tis USD v etnmanuaacutelu a implementa niacutech doporu eni
Pilot plaacutenovaniacute a management projektu a ov it informa niacute baliacute ky
Ciacutelem bylo v pilotu ziacuteskat lepiacute budget a plaacuten of projekt
N kteraacute problematickaacute data ve staryacutech formaacutetech jako stareacute databaacuteze atd pot ebujiacute chytreacute lidi kteryacuted lajiacute repetitivniacute praciacute trvajiacuteciacute dlouhopot ebovali dobry knowledge management aby to byloefektivniacute
Zp sob migrace napsali poadavky na nastroj a popis toho jak by se mela d lat manuaacutelniacute migrace
P iacuteprava dat (restructure data a registrovat metadata of IP) a p iacuteprava dokumentace t chmigrovanyacutech IP
Vyacutevoj softwaru inhouse development
Pot ebovali 50 person years na 1
Zaacutev ry
migrace standardniacutech dat je levn jiacute migrace z n kteryacutech pasek standardniacutech je levn jiacute atd
V tina 80 nakladu padla na nestardizovanaacute data p i vyacuterob softwaru na migraci Vyacutevojnaacutestroje na migraci heterogenniacutech dat nebo nestandardniacutech dat je nejdraiacute
Co se nau ili nem li dostate neacute analyzovanaacute stara data
Projekt management m li loose ztratili peniacuteze
Knowledge management dobry popis staryacutech dat a vech jejich typu generaci umiacutest ni atd u naacutes neexistuje a budeme s tiacutem miacutet potiacutee migrace staryacutech dat v NK budeproblematickaacute
Angela Dappert rubust migration workflow pro offline media- Co je archival object hezky slide cd neniacute archive object je to pro ne hand held carrier
lepiacute je bit stable object ten m e miacutet backup atd a k archivniacutemu objektu kteryacute maacute daliacutemetadat logical preservation
- Cd neniacute searchable nedaacute se snadno replikovat ma large manual overhead renderingtechnology zastaraacutevaacute velmi rychle
- Projekt endangered archives optical disks cdr external HD tapes celkem 67 terrabytes- OFFLINE hand held nosi e byly v tom projektu endangered archives velmi variabilniacute
obsahovaly data s drm pod copyrightem a radou t ch probleacutem - Moznosti mezi kteryacutemi se rozhodovali u kadeacuteho zdroje dat - Disk image jeden soubor kteryacute obsahuje vechno co na n m je- Nebo extrakce jen n kteryacutech souboru- Jak d leityacute je ten vlastniacute nosi Pot ebujeme o n m miacutet n jakeacute informace m ou tam byt
stopy po smazaniacute n jakyacutech dat a chceme je t eba miacutet Disk image d lali ze veho moneacuteho hybridniacute dvd Zvuky kde byla i data atd
- Jakyacute disk image byl m li pouiacutet Ne jen jeden formaacutet disk image pro vechna data pro kadeacutespeciaacutelniacute disk image formaacutet
- D lali to robotama disk copying robots n kdy large scal disk copying robots nelo pouiacutetumiacute dob e vyraacuteb t cd ale ne ripovat data z cd
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
10
- Ud lali si svoji aplikaci s diska stacks a n jakeacute meniacute roboty pouiacutevali LIFO nebo FIFOnakonec pouili fifo lifo mel probleacutemy se zveda kou CD
V Kb promysleli pom rn sloiteacute workflow jak to popsat atd
U kadeacuteho robota m li PC
Probleacutemy m li s radou v ci see presentation
Nenali doby sw pro management imagu jen command liny ale netechnicky staff by nasekalradu bot
Je to hodn lidi ne se to dostane na online
Musiacute byt dob e vychovaniacute flexibiln ale taky umet d lat tidieus jobs systematic patient
POZOR d leiteacute pro NK kde se p evod dat z disk bude takeacute eit a u i eil
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
11
Keep propjekt Antonio Cuiffreda towards integrated migrationenvironment
- Disk transfer tool P evede disk na image file Obsahuje tady daliacute metadata o file systeacutemua md5 souboru atd
- Keep vyraacuteb jiacute Transfer tool Framework- Magnetic media disk transfer tools for flopy disk komer niacute a opensource- Disk2FDI komer niacute DOS tool velmi p esnyacute image floppy disku trvaacute mu to 1 hodinu a celyacute
to je pak velmi velkyacute desetkraacutet vetiacute ne byl vlastniacute floppy disk Testoval asi 2260 diskutestovali emulaci
- Catweasel komer niacute nastroj je to PCI card bezi na linuxech a win xp ma gui Velkachybovost ale rychlejsi image file kvalita byla nizka
- Nibtools free tool G64 a D54 covers ony C64 dos win linux ale to command linPot ebuje commodor disk drive a special cables Testovali par disku asi p lka nefungovalapak v emulaacutetoru
- Optical media pouili 5 transfer tools u vech stejny cd a dvd a games1 Alcohol 120 komer niacute umiacute obchaacutezet drm atd support Win systems
Ze 13 fungovalo 12 2MB za sekundu2 Deamon tools commercial n kolik typu image files ISOP MDS MDF support win tri ze 13
nbefungovaly3 CloneCD commercial pouiacutevaacute IMG nebo ISO obchaacuteziacute safedisk3 protection support Win
ma gui 11 bylo ok ze 134 Blindwrite commercial podporuje dvd blue ray WM Xbox a daliacute speciaacutelniacute disky
Generuje ISO a n jakyacute proprietaacuterniacute formaacutety iso imagu jeden nefungoval rychlost stahovaniacute5 ImgBurn ne te do image file subchannel informace (nelze posouvat film atd) je to
opensource generuje dvd bin cue img win a linux 4 nefungovali je rychlyacute
Zaacutev ry
Pro magnetickaacute media komer niacute a nekomer niacute vyacutekon neniacute rozdiacutelnyacute disk2FDi je p esnyacute ale velmipomalyacute Keep pouije NibTools
Optical myslet na ochranu proti kopiacuterovaniacute majiacute podobny vyacutekon vdycky budou chyby v t chimages mezi 30 a 10 proc blindWrite umi herniacute disky xbox atd Keep pouije ImgBurn protoe je toopen source
Pro komplex images je lepiacute Blindwrite
P iacutenos pro NK
Zvaacuteit zda by v NK nebylo vhodneacute opravdu ud lat projekt na migraci obsahuCD a DVD na online media Zde prezentujiacute konkreacutetniacute zkuenosti s robotickyacutemzpracovaacutevaacuteniacute a ukazujiacute jakeacute probleacutemy m li s vymyacuteleniacutem workflow volboutypu ISO image atd
BNF archivace webuMajiacute tri vrstvy
Harvest definition collection
Harvest instance crawling metadata
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
12
ARC files
Collection bude naprilad selektivni volby 2000 pak jednotliveacute harvest instances a pak arcySbiacuterajiacute logy config a reportTohle skladuji takeacute v arcu specialni arc metadata pro kadyacute crawling instance
Premis
Object agent event
Objects1 arc files a metadata arcy2 harvest instances
Harvest event in premis event creation of content files
Events reporty jako extense eventu host report a harvest report
Agents afdministator sw instituitiolns organizations kteryacute perfomujou harvrst
ContainerMD
httpbibnumbnffrcontainerMD v1documentationcontainerMD v1html
zvlaacutetniacute metadata pro v ci z Web Archivu
httpbibnumbnffrcontainerMD v1
odlisny SLA pro ruzny typy materialu pro ruznaacute data z ruznych sklizni shared repository ocekavajiruzne benefity sklils pro ruzne formaty neniacute t eba v instituci dublovat
pristi rok by merl existovsat taky jhov2 modul pro warc
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
SPOLUFINANCOVAacuteNO ZE STRUKTURAacuteLNIacuteCH FOND EU (EVROPSKEacuteHO FONDU PRO REGIONAacuteLNIacute ROZVOJ) PROST EDNICTVIacuteM IOP
13
memento Meta vyhledava
P iacutenos pro NK
Jejich model archivace webu by se dal vyuiacutet v NK
Cost models daacutenskaacute NK + TUWien- Stephan strodl TU Viden majiacute sv j cost model ale jen small scale automated preservation
action cost se zda- Daacutenska naacuterodniacute knihovna d lali sv j model kteryacute by mel byt univerzaacutelniacute a pouitelnyacute
kdekoli- M ili cost of submission podle standardu paimas- P i po iacutetaacuteni cost pouiacutevajiacute oais a paimas mapuji aktivity na tyhle modely a pak podle toho
odhaduji ceny procesu- Costmodelfordigitalpreservationdk
P iacutenos pro NK
K projektu 0136 tam se eily monosti odhadovaacuteniacute naacuteklad na dlouhodobeacute uloeniacute
Meet RODA a Full Fledged Digital Repository for Long TermPreservation
- P vodn projekt Portugalskeacuteho naacuterodniacuteho archivu sledujeme a n kolik let Te systeacutemRODA podporuje nezaacutevislaacute firma a aacuteste n ho takeacute daacutele vyviacutejiacute Zatiacutem RODA podporuje pouzearchivniacute formaacutet metadat (EAD) ale daliacute vyacutevoj by m l zahrnout i knihovnickeacute formaacutety
- RODA je te sou aacutestiacute projektu SCAPE kde bude moneacute systeacutem daacutele vyviacutejet a kaacutelovat propouitiacute v masivniacute produkci
- httpredminekeepptprojectsroda public
P iacutenos pro NK
Sledovat daliacute vyacutevoj monaacute i pro projekty INCAD + KNAV pro vyacutevoj LTP pro meniacute instituce by tohlemohla byacutet v budoucnu zajiacutemavaacute alternativa
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Zpraacuteva ze zahrani niacute sluebniacute cesty
Jmeacuteno a p iacutejmeniacute uacute astniacuteka cesty Mgr Andrea Fojt (AF)
Pracovit dle organiza niacute struktury 15 Odd leniacute dlouhodobeacute ochrany digitaacutelniacutech dat (ODODD)Pracovit za azeniacute 152 Odd leniacute spraacutevy obsahu digitaacutelniacuteho repozitaacute eD vod cesty Uacute ast na konferenci Aligning National Approches to Digital
preservationMiacutesto m sto TallinnMiacutesto zem EstonskoDatum (od do) 22 265 2011Podrobnyacute asovyacute harmonogram Pond liacute 235 registrace na konferenci komentovanaacute prohliacutedka
Naacuterodniacute knihovny Keynote Address by Laura Campbell Kongresovaacute knihovna USAPanel 1 Technical AlignmentPanel 2 Organizational Alignment
Uacuteteryacute 245 Keynote Address by Gunnar Sahlin Naacuterodniacuteknihovna veacutedskaPanel 3 Standards AlignmentPanel 4 Legal AlignmentBreakout Sessions for panels 3 amp 4
St eda 255Panel 5 Education AlignmentPanel 6 Economic AlignmentBreakout Sessions for panels 5amp6SynthesisClosing remarks
Spolucestujiacuteciacute z NK PhDr Bohdana Stoklasovaacute (BS)Ing Tomaacute Svoboda (TS)
Finan niacute zajit niacute IOP NDKCiacutele cesty P iacutetomnost na konferenci s mezinaacuterodniacute uacute astiacute ziacuteskaacuteniacute kontakt
pro oblast dlouhodobeacute ochrany digitaacutelniacuteho dokument apovinneacuteho elektronickeacuteho vyacutetisku podrobn jiacute vhled toproblematiky dlouhodobeacute ochrany digitaacutelniacutech dokument(zejmeacutena) v naacuterodniacutech knihovnaacutech
Pln niacute ciacutel cesty (konkreacutetn ) Zaacutev ry konference Aligning National Approaches to DigitalPreservation vesm s kopiacuterujiacute zaacutev ry wokshopu The Future ofthe Past Shaping new visions for EU research in digitalpreservation (zpraacuteva dostupnaacute nahttpcordiseuropaeufp7icttelearn digicultfuture of thepast_enpdf) nap v p iacutepadeacute chyb jiacuteciacute ekonomickeacuteho modelu prokomer niacute sfeacuteru kteraacute by dlouhodobou ochranu vniacutemala jakoneodd litelnou sou aacutest vech svyacutech proces Byl navaacutezaacuten kontakt s pracovniacuteky naacuterodniacutech knihoven Estonska aFinska (pracovniacuteciacute pro archivaci webu a dlouhodobou ochranuobecn )
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Program a daliacute podrobn jiacute informace Hlavniacutem ciacutelem konference bylo sjednotit naacuterodniacute postupy v oblastidlouhodobeacute ochrany digitaacutelniacutech dokument nap iacute vemioblastmi od technickyacutech organiza niacutech vzd laacutevaciacutech a postandardiza niacute ekonomickeacute a finan niacute
P ivezeneacute materiaacutely konferen niacute program letaacuteky vystavujiacuteciacutech firem (Tessella EquellaGuardtime) + daliacute materiaacutely zaacutepisky
Datum p edloeniacute zpraacutevy 862011Podpis p edkladatele zpraacutevy
Podpis nad iacutezeneacuteho
Vloeno na IntranetP ijato v mezinaacuterodniacutem odd leniacute
P iacuteloha k teacuteto zpraacutev Poznaacutemky z konference v anglickeacutem jazyce
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
P iacuteloha Poznaacutemky z konference v anglickeacutem jazyce
Exploring What We Can Do Together Strategic Alliance for International Collaboration Laura Campbell
185 digital preservation partners in more than 25 countries (education research LAM)strategic goals National Content Stewardship Network (national digital collection technicalarchitecture public policy outreachNDIIPP Content Domain Map a mind map of geospatial audiovisual imageamptext and webcontentthen amp now cognitive surplus vs digital librariesdigital preservationsolution framework actively working together special interest groups establishing acommon index international digital collection (freely available)
PANEL 1Technical Alignment (The role of testing) Prof Dr Michael Seadle (Panel Head)
to collaborate on requiring and implementing rigorous and independent tests
DNB Contribution to the Tallinn Alignment Sabine Schrimpf
key theme is infrastructurenetwork of hard and software that permits operation of application of SWquestion of interoperability is crucial (standards technical specifications)SW elementsComponents of the DP infrastructure was compared to the pallets at railroads
o Source PARSEInsight Roadmap 2020PersistentID resolvers certification processkopal (ingest KoLIBri)nestor (German Network of Expertise)DP4Lib Digital Preservation as a Service reduce dependency between components
o redundant storage at different locationso KOLiBRI Modules
LUKII set up as an economical LOCKSS network in GermanySHAMANAPARSEN wants to bring coherence and cohesion to the digital preservation research
o trends in DP research projectso modular DP systemso distributed as SOAo elimination of technology dependencies
EDINA THE UK LOCKSS Alliance Adam Russbridge
EDINA offers underlying technical support amp coordinationthreats to digital stewardship
o failure (media HW SW network format obsolesce natural disastereconomicorganization failure)
o attack (insideroutsider)o operator error
source Requirement to Digital Preservationprojects PEPRS and PECAN help identify coverage and requirements for DP
Public testing Michael Seadle
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
traditional physical archiving relies heavily on trusted institutionsdistrust not trust need to be the basis of digital archiving testing therefore plays a key rolegoals of testing demonstrate functionality reveal weaknesses provide data for planningimprovementskey issues for testing integrity authenticity (can the origin or geniuses be shown) usability(can migrationemulation be demonstrated) access financial integrityDrWho (drwho1com)bit stream testing is the most important authenticity and usability may be impaired
o the type of storage media the number of copies + frequencies of checking andreplacement get us to the relevant results
o no reliable metrics exists however (what is an acceptable loss etc)without well documented peer reviewed publicly available test results librarians are buyingarchiving systems on faith
Presentation without a title Andy Rauber
evaluation vs testing vs benchmarkingDP testing and testing evaluation rather than testing far from benchmarking (few tests butnot near a definition of benchmarks)
o existing evaluations are not repeatableo focus on the simple thingso building the frameworks before having clear test scenarios
necessary to move towards comparative benchmarkingwhat is needed commit that we want a culture of benchmarking and comparativeevaluation understanding of what we want to benchmark benchmark data + ground truthmeasurement scales and measures that remain constant knowledge bases to collect these
Organizing digital preservation on an international level Michelle Gallinger (NDIIPP)
focus on an national DP agendacommunity driven action oriented (National Digital Stewardship Alliance)
o present a distributed national digital collection for the benefit of citizens
The European Research Arena David Giaretta
technologies GEANT EGEEEGIEU research projects TIMBUS BLOG4EVER SCAPE ENSURE APARSEN ARCOMEM WF4EVER
o SCIDIP ES (2011 2014)Alliance for Permanent Access (APA) formed as a legal entity 3 years ago
o opportunities for networkingISO 16363 Audit and Certification of Trustworthy Digital repositoriesISO 16919 Requirements for Bodies Providing Audit and Certification
Observation from the MetaArchive Cooperative program Martin Halbert
distributed DP programs different from other programso replication of content distribution of these replicated copies to distinct geographical
locations and network organization to connect these replicated copiesMetaArchive established in 2003 funded by NDIIPP
o seeks to foster broader awareness to digital preservation issuesIIPC members are all institutions that focus on WA
o 39 members national university libraries + other organizations (Internet Archive)
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
o ISO standard WARC format for web archives + Heretrix and Nutchwaxo growing membership (Africa amp South America)
DAY 2
Keynote addressInternational and National Collaboration in the digital age Gunnar Sahlin
2012 a new law for e depositSamsoumlk (search together) in 2005 (upgrade new system)Swepub and long term preservationConsortium of the Swedish research libraries for licensing e journals and databases (ICOLC)Open Access and e publishing (all universities have their repositories for e pubs)NL aggregator for the Europeana TEL Apres Athena EU screen
o common system for the preservation of digital materialso common search portal for materials from the Swedish National Library and Swedish
National Archive
Raivo Ruusalepp
standard RAC DSA CIDOC (CRM) PAIMAS ISA (DG) DDIuse of information security standards for digital preservationinformation security administrative and technical (physical = data security vs IT =communication)company implemented security measurement with typical cyber crime scenariossurvey of security
o provision for information security in national legislation and development plane (12of the respondents ISO 27000 series only 2 formal audits the rest are looking intoit frac12 of the respondents do not use standards or formal measurements to controlinformation security)
o IT amp disaster plan 65 (data recovery from the off site location tested 0)alignment
o better use of community standards for information security and preservationo agreement on security requirements
Standards based approach to preservation planning MatthewWoollard
ISO 27001 very expensive implementation 100 000 poundBasic Data Seal of Approval Guidelines (helps understand your business better)Audit And Certification of Trustworthy Digital RepositoriesMemorandum of Understanding to Create a European FrameworkISO 16363 external DSA or ISO 16363 self audit
Best Practices amp Standards Bram van der Werf
self assessment trust auditscertification trustISO 30300 (draft) Record Management
PANEL 4 Legal Alignment
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
Legal deposit amp Web Archiving Adrienne Muirlegal deposit provisions purpose scope deposit mechanisms roles amp responsibilities ampliability access provisions sanctionsimplementation definitions scope offlineonline freely availablepay well technologyneutralincrementallegal deposit vs voluntary (interimhybrid approach model agreements and licensesflexibility)other legal issues intellectual property rights preservation access unlawful materialprivacydata protectionvoluntary approaches have disadvantages but maybe necessary and can be useful
Breakout Session
standards bring along alignment by themselves but only if you use the to the full not halfwaydepends on community (users) enforceable or voluntary compliance (standards)next step for standard alignment
o corpora as benchmarkso export import completenesso educational standardso validation toolso accredited training courses to accredit auditorso framework standards
DAY 3PANEL 5 Educational Alignment
key elements of the DCC Curation Lifecycle ModelFramework for the Education Alignment (USA grads programs for digital preservation newmodels for grad programs internship programs (diverse knowledge) workshops)
o sharing toolso national programs
related issues to digital preservationo nature of costs and business modelso strategies for selection amp appraisalo ground roles and responsibilitieso effectiveness and demand for services
focus of the panel factors influencing the actual sustainability of a digital archive2 considerations collaboration + user demandchallenges + gaps span national boundaries public + private funding educationexportation DP certification competition funding gaps policy selection criteria roles ampresponsibilities standardsMagazzini Digitali e legal deposit in ItalyPADI a failed project (discontinued in 2010)
Presentation without a title Neil Grindley
how much does it cost to manage information what institutional financial strategies arerequired to facilitate effective preservation what general economic frameworks arerequired to enable information to persist and be accessible
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg
JISC 2010 Infrastructure for Education and Research Programmearchival storage and preservation activities are constituting a very small proportion of theoverall costs 1531 access 55 outreach acquisition ingest
o approx 333 Euros for a set of 1000 recordsKRDS2 p 83 future tool development supporting automation of ingest
Sustainable Preservation in North America ADPNet amp Friends Aaron Trehub
solution distributed digital preservation (in at least 3 copies vs LOCKSS 6 copies)DPP + LOCKSS = PLN open SW developed at StanfordMetaArchive COPPUL Canada ADPNet wwwadpnorg