20
PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division Arizona State Library, Archives and Public Records

PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Embed Size (px)

Citation preview

Page 1: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

PeDALSPersistent Digital

Archives & Library System

GladysAnn Wells, Director and State LibrarianLisa Maxwell, Division Director, Records Management DivisionArizona State Library, Archives and Public Records

Page 2: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

A Word from Our Sponsors Library of Congress

National Digital Information and Infrastructure Preservation Program(NDIIPP)

Institute for Museum and Library ServicesLibrary Services and Technology Act

Page 3: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Project Partners Arizona Florida New York South Carolina Wisconsin Two states to be named

Kudos to the Washington State Archives

Page 4: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Technical Goals To develop a OAIS compliant curatorial

rationale that can be implemented in software to support anautomated, integrated workflow to process collections of digital records and publications

Best suited for collections of records that grow out of routine process and have associated metadata

May be adapted to publications Not immediately appropriate for ad hoc records

Page 5: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Technical Goals To build “digital stacks” using LOCKSS as

the basis of an inexpensive storage network that can preserve the authenticity and integrity of the materials.

Page 6: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Additional Goals To build a community of shared practice

that meets the needs of a wide range of repositories For best practices ~ what works, what’s

practical For resource sharing ~ avoid redundant

work

To remove barriers to preservation by keeping costs as low as possible

Page 7: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Curatorial Rationale Transformation of traditional, paper-based

practices into the digital arena that focuses on explaining why we do things in a particular manner Appraisal Acquisition Arrangement and description Housing and storage Reference and access Preservation

Page 8: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Architecture

Page 9: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Automated Processing Curators work with rules, not records

Describe business processes (rules) Monitor the process for quality assurance

Rules expressed in software A “pipeline” that transforms records as they

move through the system

Based on Microsoft BizTalk middleware

Page 10: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Preparatory Work with Provenance For each series of records selected for transfer

Negotiate metadata you will receive Negotiate format of the records Negotiate format of the submission information

package(records, metadata, shipping manifest)

Negotiate frequency and manner of transfer

Provenance develops procedures to create SIPs Exploring the use of LC’s BagIt

Archives describes business rules in middleware

Page 11: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Preliminary Processing Describe provenance Describe series-level

Creator, Provenance, Source of Records Series title, Date ranges Scope note Access points: activities, topics

Describe accession Assign accession number, date Assign unique system number Record source, transfer authority, restrictions

Page 12: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Submission Provenance transfers submission

information packages (SIPs) to a drop box FTP, sFTP, scp Tape CD, DVD

Each Provenance has a directory Each series has a subdirectory

Isolated for virus scanning Option: simple Linux box

Page 13: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Data Wrangling Before business rules are launched, the curator will

validate the SIPs to ensure they conform to the negotiated specification.

Includes running New Zealand Metadata Extractor to create Software format Version File size Mime type

Some problems we’ve encountered during this step License number did not link to the file containing the record XML contained invalid control characters TIFFs were delivered instead of PDFs

Page 14: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Automated Item-Level Description:PeDALS Core Metadata Schema Administration

Accession number Transfer authority Restrictions

Discovery Provenance Series Title Date Summary (abstract,

scope) Access points

Preservation File format, version Hash

Single schema used for all records, contains elements common to most records

http://pedalspreservation.org/Metadata/Default.aspx

Version 2.0 out mid August

Page 15: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Mapping and Creating Metadata<?xml version="1.0" ?> <keywords>

<keyword>  <name>Bride First Name</name>   <value>ROBIN</value>   </keyword>

<keyword>  <name>Bride Last Name</name>   <value>STERNBERGER</value>   </keyword>

<keyword> <name>Groom First Name</name>   <value>MANUEL</value>   </keyword>

<keyword>  <name>Groom Last Name</name>   <value>AVILES</value>   </keyword>

<?xml version="1.0" ?> <keywords>

<keyword>  <name>Bride First Name</name>   <value>ROBIN</value>   </keyword>

<keyword>  <name>Bride Last Name</name>   <value>STERNBERGER</value>   </keyword>

<keyword> <name>Groom First Name</name>   <value>MANUEL</value>   </keyword>

<keyword>  <name>Groom Last Name</name>   <value>AVILES</value>   </keyword>

 <keyword> <name>License #</name>   <value>411811</value>   </keyword>

<keyword>  <name>Marriage Date</name>   <value>12-17-2005</value>   </keyword>

<keyword>  <name>Recording Date</name>   <value>1-19-2006</value>   </keyword>

<keyword>  <name>Batch Name</name>   <value>ML01 SAD 01/13/06</value>   </keyword>  </keywords>

Page 16: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Mapping and Creating Metadata Title

MARY ALICE CICERALE and ROBERT PORTER : marriage certificate, 2008

Extent 1 Adobe Acrobat PDF (34379 bytes)

Date 3 Jan 2008 : Marriage date 12 Jan 2008 : Recording date

Access CICERALE, MARY ALICE : Bride PORTER, ROBERT : Groom

Page 17: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Create Archival Information Package Update Accessions Register database Create AIP

<AIP><Normalized Metadata> </Normalized Metadata ><Received Metadata> </Received Metadata><Record> </Record>

<Transformed record> </Transformed record></AIP>

Page 18: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Ingest into the Digital Stacks AIPs transferred to a LOCKSS Cluster

LOCKSS Redundant Array of Inexpensive Servers Automatic integrity checking Automatic error-correction Geographically dispersed copies Bitstream preservation See LOCKSS.ORG

Page 19: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

Public Access For records that are not confidential, the

middleware creates dissemination information packages, puts them on public webserver Doesn’t include administrative, preservation

metadata that users will not likely want In format easily supported by common browsers

Middleware updates public catalog for website We assume most partners will integrate these

search pages into their existing website.

Page 20: PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division

For more information http://www.pedalspreservation.org/

Principal Investigator Richard Pearce-Moses

[email protected]

Project Coordinator Sara Muth

[email protected]