39

MyersTessella_Dec2013

Embed Size (px)

DESCRIPTION

A presentation that I have given at various points about KY's experience with a could-based digital preservation program.

Citation preview

Page 1: MyersTessella_Dec2013
Page 2: MyersTessella_Dec2013

Electronic Records Archivist

Public Records Division,

Kentucky Department for Libraries and Archives

E-mail: [email protected]

Phone: (502)564-8300 Ext. 244

4/30/2009 2

Page 3: MyersTessella_Dec2013

Kentucky Department for Libraries and Archives

State Library State Archives

State Records Management

Page 4: MyersTessella_Dec2013

Development of e-Archives

2003 – Began archiving Governor’s website (recreated Governor’s 1997 website)

2010 – Began evaluation of Archive-It – Became full member in 2012

2011 – Participated in ISO-16363 test audit

1996 – Began downloading publications off the Internet

2009 – e-Archives relaunched with Dspace

2002 – Electronic Records Working Group Formed

2001 – Electronic Records Archivist Hired

2006 – e-Archives is available to the public on the Internet

2008- Began evaluation of Dspace

2012 – Purchased the Preservica digital preservation system

1984 – KDLA receives a Machine Readable Records Grant from NHPRC

2011-2012 – Began evaluation and pilot of Tessella digital preservation system

Page 5: MyersTessella_Dec2013

Publications,

Minutes,

Geospatial datasets (maps, shapefiles),

Web sites,

Databases,

Digital images,

Video, and

Audio recordings.

Page 6: MyersTessella_Dec2013

Approximately 5 TB in e-Archives Includes multiple copies

1.6 TB of GIS data (vector data and imagery)

0.5 TB of high resolution Digital Photos

0.3 TB Includes textual material, digital photo, audio/video and website snapshots

300 GB of Governors databases (mail database)

100 GB of Scanned Images (historic documents)

Page 7: MyersTessella_Dec2013

DSpace is primary access tool <14,000 = files (objects) in DSpace

~160 GB in Asset Store

Some publications cross-linked to library catalog

But several things NOT in DSpace Snapshots of Governor’s Office Website

GIS Data Some shapefiles and map images in DSpace

Geodatabase snapshots on file server

Backlog

Page 8: MyersTessella_Dec2013
Page 9: MyersTessella_Dec2013
Page 10: MyersTessella_Dec2013

Pros:

Cost = $ Free

Open source

Out-of-box product

Great access tool

Remote deposit Templates

Single item uploads

Cons:

Cost = IT support

Open source – IT support

Out-of-the-box product

Mass Import hard

Compound Objects

Lack of “Robust” preservation tools*

* Newer versions of DSpace adding more preservation tools

Page 11: MyersTessella_Dec2013

Summer 2011 Participated in the PTAB audit Primary Trustworthy Digital Repository Authorization

Body

6 Test audits 3 in US & 3 in Europe

KY only government archives

Centered around: ISO 16363 – Requirements for Audit and Certification of

Trustworthy Digital Repositories

ISO 16919 – Requirements for bodies providing audit and certification of Trustworthy Digital Repositories

Page 12: MyersTessella_Dec2013

2 Parts: Self Assessment – sent in advance of audit team

On-Site visit – July 6-7, 2011

Focused on: Technology and Infrastructure

Policies and Procedures

Administration

Page 13: MyersTessella_Dec2013

“None

Shall

Pass!”

Page 14: MyersTessella_Dec2013

METRIC 3.1.2 – The repository shall have a preservation strategic plan that defines the approach the repository will take in the long-term support of its mission.

Move to a “Financially Sustainable” program

ID all the designated communities

Strategic Plan

ID the scope of the system – separate operational and developmental components

Page 15: MyersTessella_Dec2013

METRIC 4.3.1 – The Repository shall have documented preservation strategies relevant to its holdings.

Develop a Systematic Approach to digital preservation

All records integrated into a single preservation and access program

Define the AIP

System for validating the integrity of the records

Page 16: MyersTessella_Dec2013

METRIC 5.2.1 – The Repository shall maintain a systematic analysis of the security risk factors associated with data, systems, personnel, and physical plant.

Multiple-geographically dispersed copies of AIP

More defined security, backup, disaster plan (migration strategy)

Page 17: MyersTessella_Dec2013

Final recommendation: In keeping with the spirit and intent of ISO 17021 and

ISO 16919, this report makes no recommendations as to how KDLA might address the enumerated findings in the preceding section.

Page 18: MyersTessella_Dec2013

Picture of my son removed

Page 19: MyersTessella_Dec2013

“Financially Sustainable” program “Line item” funding better than “Ad-hoc” funding

Government archives serve at the whim of the government

Strategic planning underway All policies and procedures

Identifying operational and developmental systems

Watching out for the buses

Designated Communities – Everyone

Page 20: MyersTessella_Dec2013

All records preserved in a single preservation and access system DSpace – Publications, Photo, Audio, Video, Scanned

documents, Maps, Shape files

Website Snapshots – On our own, and Archive-It

GIS Databases and Imagery

Digitized Film/Audio

Other data on file server

Page 21: MyersTessella_Dec2013

Automate largely distributed process of ingest and preservation to lessen cost of labor and be more efficient

Import effectively from our access system

Establish connections from our network to preservation system and back again

Gain intellectual and physical control over growing number of electronic records

Manage the holdings through time

Page 22: MyersTessella_Dec2013

Demonstrations of the product Stand-alone product – Safety Deposit Box (SDB)

Examined studies by state archives

Established a list of functional requirements

Began a pilot with a cloud-based solution 6 weeks test of developing Preservica product spring of

2012

Negotiated a contract for the service

Purchased Preservica in November 2012

http://www.digital-preservation.com/

Page 23: MyersTessella_Dec2013

IT Consolidation

Page 24: MyersTessella_Dec2013
Page 25: MyersTessella_Dec2013
Page 26: MyersTessella_Dec2013
Page 27: MyersTessella_Dec2013
Page 28: MyersTessella_Dec2013
Page 29: MyersTessella_Dec2013

Cost – Cheaper than stand-alone system

Full range of services and support

Use of templates to automate descriptive and accession

metadata insertion

Automated technical data extraction from files

Format normalization and migration

Administrative control of the archive

Geographic dispersal of copies KDLA Goal – Local copies of AIP

Little or no on-site maintenance

Page 30: MyersTessella_Dec2013
Page 31: MyersTessella_Dec2013
Page 32: MyersTessella_Dec2013

Currently have about 0.25 TB in Preservica 135820 files, for a total 262 GB

Focusing on things not in DSpace

Working through metadata issues XIP – METS – DC – It’s all XML, right?

Integration with Dspace More metadata issues: Qualified DC vs DC

Not really designed for public access Still relying on DSpace for access

Problems with the “Pipe” File Size upload issues

Page 33: MyersTessella_Dec2013
Page 34: MyersTessella_Dec2013

Added our accession metadata.

Page 35: MyersTessella_Dec2013
Page 36: MyersTessella_Dec2013

Working on adding the DSpace Dublin Core metadata.

Page 37: MyersTessella_Dec2013
Page 38: MyersTessella_Dec2013

Figure out what you can do and work on it. Exploit your strengths

Micrographics/Digital Imaging

Publications

Partner with somebody Collaborative projects

Join groups or committees

Keep it simple Don’t reinvent the wheel

Look for practical/realistic solutions

Don’t be afraid to get dirty!

Pictures of my children removed

Page 39: MyersTessella_Dec2013

E-mail:

[email protected]

Phone:

(502)564-8300 Ext. 244

KDLA web site:

http://www.kdla.ky.gov

E-Archive website: http://kdla.ky.gov/records/e-

archives/Pages/default.aspx

Thank You!