OCLC Digital Archive Overview
Judith CobbLIPA Meeting
July 2006
Agenda
• Functional Overview
• Availability of Services
OCLC Digital Archive: A Partnership
• OCLC – Technical management and operations– Developments and improvements– Preservation planning
• Collection Owners– Collection/selection policy– Ingest and collection management– Access and rights management– Preservation planning
Preservation
The Digital Archive User’s View
Capture and ingest tools
Access(Cataloging)
Collection Management Tools
Archival Storage for digital
objects and metadata
Web ToolsDigital Masters
WorldCatFirstSearch
Local catalogURL
Admin Module
Ingest
Digital Preservation Process
The Digital Archive Capture
tools
Access(Derivatives)
Collection Management Tools
Archival Storage for digital
objects and metadata
Six hour incremental tape back upDaily full back up – off siteWeekly off site back up –
all systems and dataPurpose built facility, generators,
security
Web ToolsDigital Masters
WorldCatFirstSearch
Local catalog
Fixity CheckVirus CheckObject Verify
Admin Module
URL
Google via OWC
PlanningAnalysisActions
Connexion
Fixity CheckVirus CheckObject Verify
Ingest
Digital Preservation Process
Capture Tools Capture
tools
Access(Derivatives)
Collection Management Tools
Archival Storage for digital
objects and metadata
Six hour incremental tape back upDaily full back up – off siteWeekly off site back up –
all systems and dataPurpose built facility, generators,
security
Web ResourcesDigital Objects
WorldCatFirstSearch
Local catalog
Fixity CheckVirus CheckObject Verify
Admin Module
URL
Google via OWC
PlanningAnalysisActions
“Oversize”
Fixity CheckVirus CheckObject Verify
Capturing Web Resources
• Why do we need different tools for web capture?
• Harvests web content
• Virtually rebuilds a web site so it is functional in its new environment
Capturing Digital Objects
• Content Cooperative Pilot
– July 2006 through December 2006
– Easy “upload” option within Connexion Browser and Connexion Client
– Size limitations
Capturing Digital Masters
• “Oversize”– Large files– Many files
• Transfer objects and metadata via media– CD, DVD, external hard drives
Ingest
Digital Preservation Process
Ingest Capture
tools
Access(Derivatives)
Collection Management Tools
Archival Storage for digital
objects and metadata
Six hour incremental tape back upDaily full back up – off siteWeekly off site back up –
all systems and dataPurpose built facility, generators,
security
Web ToolsDigital Masters
WorldCatFirstSearch
Local catalog
Fixity CheckVirus CheckObject Verify
Admin Module
URL
Google via OWC
PlanningAnalysisActionsFixity Check
Virus CheckObject Verify
Ingest to the Digital Archive• Preservation metadata created
• Technical metadata created
• Upon ingest and one day later– Virus check on all files– Fixity check on the digital object
• Back-up schedule begins immediately
Ingest
Digital Preservation Process
Storage, Disaster Preparedness, and Data Management
Capture tools
Access (Derivatives)
Collection Management Tools
Archival Storage for digital
objects and metadata
Six hour incremental tape back upDaily full back up – off siteWeekly off site back up –
all systems and dataPurpose built facility, generators,
security
Web ToolsDigital Masters
WorldCatFirstSearch
Local catalog
Fixity CheckVirus CheckObject Verify
Admin Module
URL
Google via OWC
PlanningAnalysisActionsFixity Check
Virus CheckObject Verify
Storage, Disaster Preparedness, and Data Management
• Disaster Preparedness– Ongoing back up onsite– Daily off site back up (off site)– Weekly data and software back up– Disaster preparedness plan
Storage, Disaster Preparedness, and Data Management
• Data Management– Quarterly– Fixity check performed, report sent– Virus check performed, report sent– Object verified
• All components exist, viable, accessible
Ingest
Digital Preservation Process
Collection Administration Capture
tools
Access(Derivatives)
Collection Management Tools
Archival Storage for digital
objects and metadata
Six hour incremental tape back upDaily full back up – off siteWeekly off site back up –
all systems and dataPurpose built facility, generators,
security
Web ToolsDigital Masters
WorldCatFirstSearch
Local catalog
Fixity CheckVirus CheckObject Verify
Admin Module
URL
Google via OWC
PlanningAnalysisActionsFixity Check
Virus CheckObject Verify
Collection Administration
• Content Groups
• Rights Statements
• Reports
• Access Groups
Ingest
Digital Preservation Process
Preservation Capture
tools
Access(Derivatives)
Collection Management Tools
Archival Storage for digital
objects and metadata
Six hour incremental tape back upDaily full back up – off siteWeekly off site back up –
all systems and dataPurpose built facility, generators,
security
Web ToolsDigital Masters
WorldCatFirstSearch
Local catalog
Fixity CheckVirus CheckObject Verify
Admin Module
URL
Google via OWC
PlanningAnalysisActionsFixity Check
Virus CheckObject Verify
Preservation Strategy Overview
Format Assessment
CyclicalFormat
Analysis
Identify ActionOptions
Take ActionFormat
migration to canonical
formatNot format migration
High risk
Best action determined
Ongoing Maintenance and Media Refreshing
Ingest
Digital Preservation Process
Access Capture
tools
Access(Derivatives)
Collection Management Tools
Archival Storage for digital
objects and metadata
Six hour incremental tape back upDaily full back up – off siteWeekly off site back up –
all systems and dataPurpose built facility, generators,
security
Web ToolsDigital Masters
WorldCatFirstSearch
Local catalog
Fixity CheckVirus CheckObject Verify
Admin Module
URL
Google via OWC
PlanningAnalysisActionsFixity Check
Virus CheckObject Verify
Access
• You control who can access your content, depending on your needs– Public– By OCLC authorizations
• Many, or few• Particularly useful for digital masters preservation
Access
• Open URL• WorldCat, FirstSearch• Local Catalog• Open WorldCat
• Thumbnails• Zoom and pan
OCLC Digital Archive Services
Web Capture
• Web Harvesting via Connexion Browser– Manual harvesting– Annual subscription– Bit preservation fee
From Connexion, Create Digital Archive Preservation Metadata Record
Digital Archive Preservation Metadata Record Created and Ready for Editing
Harvest Request
Review/Confirm/Revise Harvested Content
Harvest Completed; Ready to Ingest
Ingest complete; link to object in the Archive now active
Access
Web Capture
• Web Archives Workbench BETA– Web harvesting and collection builder– Ongoing development– Incremental releases– BETA become available December 2005– Replacement tool for current web harvesting
tool
Web Archives Workbench BETA
Digital Masters
• Digital Masters (“oversize”)– Batchloading via media– One time loading fee– Bit preservation fee
Connexion
• Content Cooperative– Pilot project beginning July 2006– Upload content via Connexion– 20 GB free storage– Free FirstSearch subscription for duration of
pilot • Holdings show in Open WorldCat
– Group catalogs of digital content available– Zoom and pan functionality added for images
Attach Digital Content
Access via Find In A Library
• http://www.oclc.org/digitalarchive– Documentation
• Web document harvesting• Connexion (CCP)• Digital masters service
– Preservation metadata set– Preservation policy– Ordering information
• Questions?