NARA Report: NARA Persistent Archives Prototype

Preview:

DESCRIPTION

NARA Report: NARA Persistent Archives Prototype. Bill Underwood GTRI, Atlanta CCSDS, MOIMS DAI / IPR WGs Toulouse, 2 Nov-5 Nov 2004. Project Members. Reagan Moore, PI, SDSC SDSC - Richard Marciano, Wayne Schroeder Univ. of Maryland - Joseph Jaja SLAC - Jean Deken GTRI - Bill Underwood. - PowerPoint PPT Presentation

Citation preview

NARA Report: NARA Persistent Archives Prototype

Bill UnderwoodGTRI, Atlanta

CCSDS, MOIMS DAI / IPR WGsToulouse, 2 Nov-5 Nov 2004

Project Members

Reagan Moore, PI, SDSC

SDSC - Richard Marciano, Wayne Schroeder

Univ. of Maryland - Joseph Jaja

SLAC - Jean Deken

GTRI - Bill Underwood

Project Objectives

• Virtual Data Grid Services

• Ingestion Workflow Prototype

• XML Schema for SIP

• Data Description Languages

Virtual Data Grid Services

• Archival services provided by GTRI– File System Packaging with NARA Metadata in XML

DTD Manifest– File type identification– File conversion– File viewers and readers– Information Extraction

• Services described in WSDL• Register Services• Discover and request archival services• Demonstrate on SLAC science data

Ingestion Workflow Prototype

• Addressing issues similar to the Producer-Archive Interface– Provides data to NARA based on a prior agreement

with Records Creator– Consists of metadata server and an ingestion client– Provides initial arrangement, context and metadata

• NARA – validates digital objects and metadata, – stores objects in a digital repository and– stores metadata in a catalog

• Demonstrate on SLAC science data.

XML Schema for SIP

• Modifications of MET to meet NARA records management requirement.

• Client generates and receive METs documents.• Client contacts Metadata server using X.509

certificates.• Metadata server stores METS items in a MySQL

database. • Metadata server manages certificates.• NARA server verifies metadata integrity against

schema and specification document.

Data Description Languages for Data Grids

• BinXM. Westhead and M. Bull. Representing

Scientific Data Sets on the Grid, EPPC, University of Edinburgh, Jan 2003.

• DFDLM. Westhead. Data Format Description

Language – Primer, Global Grid Forum Data Format Description Language Working Group

Recommended