14
NGDA Architecture Update Greg Janée

NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Embed Size (px)

Citation preview

Page 1: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

NGDA Architecture Update

Greg Janée

Page 2: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 2

Three motivations

• Archival has to be cheap & easy– little incentive– no funding

• Need to archive data semantics– key differentiator from text, audio, video

• Focus on long-term preservation– need to migrate whole systems

Page 3: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 3

system

databasestorage

handleresolver

database

Typical repository architecture

database

handleresolver

database

fragile

Page 4: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 4

NGDA architecture

storage subsystem

standard, public data model

archival system

ADL OAIbulk

loader

databases,caches,

etc.

Web

access ingest

Page 5: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 5

Post-NGDA architecture

storage subsystem

standard, public data model

Web

Page 6: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 6

Storage system requirements

• Req’s:– associate UUIDs/RIDs with bitstreams– retrieve global/local bitstream by UUID/RID– determine (parent) UUID of any bitstream– list all UUIDs

• Satisfied by:– any filesystem– tag URIs for UUIDs

• tag:library.ucsb.edu,2005:identifier

Page 7: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 7

Archival objects

directoryUUID

componentRID

UUID

Page 8: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 8

Archival objects

• Directory info per component– named relationship/position– format & semantics

• by UUID references to definitions

– fixity: checksum– provenance: isDerivative– policy: mutability– rights

• Components may be provided by archive itself

Page 9: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 9

Example

USGS

DOQQ

GeoTIFFFGDC

Object x

x.tiffx.fgdc x.gif

met

adat

ad

ata

derived

TIFFsubtypeOf

Page 10: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 10

Archives

• Archive = set of archival objects– no structure– no free-floating bitstreams

• In anticipation of federation:– associations may cross archive boundaries– archival objects may not

Page 11: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 11

Object types

• Content• Format definition• Semantic definition• Provider• Organizational structures

– collection– series– ingest session

Page 12: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 12

Archive-provider agreement

• Defines– common structure of objects to be ingested– necessary validations– associations to other objects– policies, rights, etc.

• Represents choke point– requires human evaluation

Page 13: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 13

Deferred functionality

• Incremental ingest• Object revisions• Rights• 3rd-party access• Federation

Page 14: NGDA Architecture Update Greg Janée. Greg Janée May 16, 20052 Three motivations Archival has to be cheap & easy –little incentive –no funding Need to

Greg Janée • May 16, 2005 14

Status

• Starting development now

• Approach: iterative refinement