Experiment, Document & Decide
a Collaborative Approach to Preservation Planning at the BnF
IPRES - November 3rd, 2015
2
The BnF
IPRES - November 3rd, 2015
3
Collections14M books30M prints and photographs250,000 manuscripts+ 900,000 sound documents, 50,000 multimedia
documentsFrench Web Legal Deposit (billions of files)And also music scores, medals and coins, maps, globes,
theater objects…
1M readers per year
300,000 exhibition visitors
Budget : 250 M€
Staff : 2 200 full-time equivalent
Some facts
IPRES - November 3rd, 2015
Digital archiving at BnF
SPAR - Infrastructure
SPAR - Realization
Ingest
SPAR
Storage Abstraction Service (SAS)
Administration
Data management
Storage
Access
Preservation planning
Prod
uctio
n ap
plic
atio
nsD
issemination applications
Preservation digitization
…
wayback
WEB Archiving
…
Records Management
Gallica (digital library)
Records Management
4IPRES - November 3rd, 2015
Different tracks• To deal with data variability and heterogeneity, tracks are
defined.• These are built on the relation between digital objects and the
archival system, independently of any given organization: – Heritage digitization;– Audiovisual legal deposit;– Negotiated legal deposit (e-books, large posters…);– Automatic legal deposit (surface Web);– Administrative production;– Third party archiving;– Acquisition / Donation;+ reference track
5IPRES - November 3rd, 2015
6
SPAR data model: Reference packages
IPRES - November 3rd, 2015
AIP DIPSIP
InformationpackageChannel
Track
Event
AgentManifest Data Object
Format
is described by
is described by
is described by
is described by
is member ofimplies
has event
format
describes
is described in
SLA
is applicable for
7IPRES - November 3rd, 2015
SPAR milestones
8IPRES - November 3rd, 2015
Functions of the Preservation Planning OAIS entity
9IPRES - November 3rd, 2015
SPAR implementation of the Preservation Planning OAIS Entity
10IPRES - November 3rd, 2015
Use case: introducing a new format
Heritage Digitization:
– tests– new format in existing channels
11SPAR
Preservation Planning GUI - Format
12IPRES - November 3rd, 2015
Use case: from MS Office to PDF
• Transforming Office Documents to PDF
13
Preservation Planning GUI - Agent
IPRES - November 3rd, 2015
14IPRES - November 3rd, 2015
Preservation Planning GUI - Tests
People involved in preservation planning
ADMINISTRATOR PRESERVATIONEXPERT
DEVELOPER COLLECTIONMANAGER
RISKS & EMERGENCY PLAN
TRACK MANAGER
15IPRES - November 3rd, 2015
16IPRES - November 3rd, 2015
A collaborative approach
• Skills are spread around the library and they are seldom
• First use in real life: positive feedback (though improvements needed)
• More visibility, not only admins have access to the system settings
• Providing a practical and operational framework
18IPRES - November 3rd, 2015
Reference package: Channel
Informationpackage
FIL_REF_CHANNEL FIL_REF
AQS-VSLA in machine
actionable format(XML transformed
in RDF within SPAR)
AQS-PAQS-D
Schematron tovalidate
specific METS profile of the channel
Human readabledocumentation
19IPRES - November 3rd, 2015
Reference package: Format
Informationpackage
FIL_REF_FORMAT FIL_REF
Format descriptionin machine actionable
format(XML transformed
in RDF within SPAR)
Machine actionable file (e.g. to validate like a XSD schema)
Human readable documentation
(standard or specifications)orFormat
sample
With all these descriptions, the
system has its own format registry.
20IPRES - November 3rd, 2015
Reference package: Agent
Informationpackage
FIL_REF_AGENT FIL_REF
Human readabledocumentationorTool
Source code
Agent descriptionin machine actionable
format(XML transformed
in RDF within SPAR)
SPAR is auto-documented and contains all information about software environment
to use each format
21IPRES - November 3rd, 2015
Reference package: Channel
• 3 SLAs: Ingest, Preservation, Access• Formalize in XML the ways of managing the
packages• Those 3 SLAs are recorded in a reference
package that describes the channel
SLA-I.xml, SLA-P.xml, SLA-A.xml
Mets.xml
Contract.pdf
22IPRES - November 3rd, 2015
Reference package: Format
Mets.xml: manifest
T000001.jp2: sample
format.xml: machine readable description
format.txt: human description
23IPRES - November 3rd, 2015
Channel reference package elaboration sequence
24IPRES - November 3rd, 2015
• autres captures d’écran