14
Digital Preservation Panel Medusa at the University of Illinois at Urbana- Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation Librarian Tom Habing, Head of Software Development University of Illinois Urbana-Champaign Library Short Paper Presentation July 23, 2013 2013 Joint Conference on Digital Libraries Indianapolis, Indiana

Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Embed Size (px)

Citation preview

Page 1: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Digital Preservation Panel

Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on

PREMIS

Kyle Rimkus, Preservation LibrarianTom Habing, Head of Software Development

University of Illinois Urbana-Champaign Library

Short Paper Presentation

July 23, 20132013 Joint Conference on Digital Libraries

Indianapolis, Indiana

Page 2: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

OAIS Management

SIP: Submission Information PackageAIP: Archival Information PackageDIP: Dissemination Information Package

Page 3: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Access Systems• ContentDM: digitized unique and rare materials from locally held

collections• DSpace: IDEALS institutional repository and electronic theses and

dissertations• ARTStor: image collections used for teaching and learning• Internet Archive: books digitized through participation in the Open

Content Alliance• HathiTrust: digitized books• Voyager catalog and locally-developed web pages: digitized books and

book-like content • Archon: digitized manuscripts and archival content • Olive ActivePaper: digitized newspapers• Ensemble Video: streaming video content, primarily digitized from the

University Archives• DLXS: XML-driven text-analysis research projects

Page 4: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Medusa Digital Preservation Service

• https://medusa.library.illinois.edu/• Documentation:

https://wiki.cites.illinois.edu/wiki/display/LibraryDigitalPreservation/Home

• enduring storage and management environment for digital content at the University of Illinois at Urbana-Champaign Library

• Infrastructure based on the Open Archival Information System reference model (OAIS)

• policies and underlying technological architecture designed in keeping with the model of a Trustworthy Digital Repository (TDR)

Page 5: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

National Digital Infrastructure and Information Preservation(NDIIPP) Program grants•Phase I : 2004-2007•Phase II: 2007-2010

“Hub and Spoke” (HandS tool suite): http://dli.grainger.uiuc.edu/echodep/hands/index.html

Page 6: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Metadata for Digital Preservation

Intellectual entities

Intellectual entities RightsRights

ObjectsObjects AgentsAgents

EventsEvents Metadata standards in Medusa:•PREMIS (digital preservation metadata)•MODS (descriptive metadata)

Page 7: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

• Quick Overview of Architecture

• Modeling Complex/Compound Objects in PREMIS

• Fedora Content Models for PREMIS

Part 2

Page 8: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Medusa Technical Architecture

Disaster RecoveryPrimary

Bit-level Preservation Object-level Preservation

Collections Registry

Fedora

Blacklight

SOLR

Content Storage Content Storage

Hydra

DX6000 Storage Cluster

Storage Node

Storage Node

...

DX6000 Storage Cluster

Storage Node

Storage Node

...

Replication

Content File Server

SOLRizer

SQL

Dat

abas

e

File SpoolAkubra

Page 9: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Modeling Complex/Compound Objects with PREMIS

Intellectual entities

Intellectual entities RightsRights

ObjectsObjects AgentsAgents

EventsEvents

Page 10: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

PREMIS Relationship Type & Subtype• Collection

• Is Member Of• Metadata

• MODS• MARC• …• Parent

• Basic Image Asset• Production Master• Archival Master• Screen Size• …• Parent

• Paged Text Asset• Pages• LoRes PDF• HiRes PDF• …• Parent

• Basic Compound Asset• First Child• Child• Parent• Next Sibling• Previous Sibling

• Derivation• Has Source• Is Source Of

• …

Page 11: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Example

<relationship> <relationshipType>Paged_Text_Asset</relationshipType> <relationshipSubType>Pages</relationshipSubType> <relatedObjectIdentification> <relatedObjectIdentifierType>LOCAL</relatedObjectIdentifierType> <relatedObjectIdentifierValue>MEDUSA:xxx.2</relatedObjectId </relatedObjectIdentification> </relationship>

Page Images

RepresentationMEDUSA:xxx

CollectionIs Member Of

Filexxx_1.jpg

Paged Text AssetPages

RepresentationMEDUSA:xxx.2

Basic Compound AssetFirst Child / Parent

RepresentationMEDUSA:xxx.1

Filexxx_1.jp2

Basic Image AssetProduction Master

RepresentationMEDUSA:xxx.2

Basic Compound AssetNext Sibling / Previous Sibling

Filexxx_2.jpg

Basic Image AssetScreen Size

Filexxx_2.jp2

Basic Image AssetProduction Master

Basic Compound AssetNext Sibling / Previous Sibling

...

Basic Compound AssetParent

Basic Image AssetScreen Size

Page 12: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

• Separate Fedora Object and Content Model for Each PREMIS Entity

• All PREMIS relationships are converted to Fedora RELS_EXT

PREMIS to Fedora Content Model

-PREMIS_EVENT()

-DC-RELS_EXT

PID: EVENT_1234

-PREMIS_AGENT()

-DC-RELS_EXT

PID: AGENT_1235hasAgent

-PREMIS_REPRESENTATION_OBJECT()

-DC-RELS_EXT

PID: REPRESENTATION_OBJECT_1236

-PREMIS_FILE_OBJECT()-CONTENT()

-DC-RELS_EXT

PID: FILE_OBJECT_1237relationshipType.Subtype

hasObject

Page 13: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

• info:fedora/afmodel:MedusaPremis_FileObject

– DC– RELS-EXT • http://medusa.library.illinois.edu/ns#relationshipTypeRelationshipSubtype• http://medusa.library.illinois.edu/ns#relationshipTypeRelationshipSubtypeLinkingEvent• http://www.loc.gov/premis/rdf/v1#hasEvent• http://www.loc.gov/premis/rdf/v1#hasRightsStatement• http://www.loc.gov/premis/rdf/v1#hasIntellectualEntity

– PREMIS-FILE-OBJECT (inline PREMIS XML)– CONTENT (managed content)

Fedora CModel for PREMIS File Object

Page 14: Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation

Questions?

• Medusa documentation (more coming soon) :• https://medusa.library.illinois.edu • https://wiki.cites.uiuc.edu/wiki/display/

LibraryDigitalPreservation/HomeMore on digital preservation at UIUC:• http://www.library.illinois.edu/prescons/

services/digital_preservation/digital_preservation.html

• Contact: [email protected]