View
214
Download
0
Tags:
Embed Size (px)
Citation preview
The Mellon-Funded The Mellon-Funded Fedora ProjectFedora Project
Technical Specifications ReviewTechnical Specifications ReviewAugust 26, 2002August 26, 2002
Sandy Payette
Cornell Information Science
Fedora Access Sub-SystemFedora Access Sub-System
D isse m in a tio n
H T T P
A PI -A Fedora -API-A .w sd l
S O A PH T T P
A PI -M Fedora -API-M .w sd l
S O A P
http
smtp
othe
r
http
smtp
othe
r
http
http
E x ter n a lC o n ten tS o u rc e
E x ter n a lC o n ten tS o u rc e
http
ftp
E x ter n a l C o n ten tR etr iev er
X M L F ile s
R elat io n al D B
Batc h I n g es t C lien t W eb Br o w s erM an ag em en tC lien t
C o m p o n e n tM a n a g e m e n t
O b j e c tR e f le c tio n
S O A P
http
S O A P
http
H T T P
http
S es s io n M an ag em en t S u b s y s tem- U s er A u th en t icat io n
P o l icies
O b j e c tV a lid a tio n s v c1
s v c2
T ra ns port P ro toc ol
M e s s a ge P rotoc ol
O b j e c tM a n a g e m e n t
P I DG e n e r a tio n
P o lic yM a n a g e m e n t
U s ers /G ro u p s
h t tp
ftp
L o calS erv ices
D atas tr eam S to r ag e
D ig ita l O b jec t S to r ag eS to rag e S u b s ys te m
M an ag e m e n tS u b s ys te m
S e c u rityS u b s ys te m
A c c e s sS u b s ys te m
F e do r aW e b Se r vi c eE xpo s ur eL aye r
C l i e nts
M an ag edC o n ten t
HT
TP
h t tp
SO
AP
h t tp
s m tp
o th er
R em o teBeh av io r
M ec h an is mS er v ic e
httphttp
SO
AP
R em o teBeh av io r
M ec h an is mS er v ic e
HT
TP
FEDORA FEDORA Implementation TechnologiesImplementation Technologies
Fedora Web Services Layer– Apache Axis for SOAP over HTTP– Apache Tomcat 4.1
Core Repository System– Sun JDK1.4– Xerces 2.0.2 for XML parsing
Deployment Platforms– Windows– Solaris– Linux
Fedora Management Sub-SystemFedora Management Sub-System “API-M”“API-M”
Object ManagementObject Component ManagementObject ValidationPID GenerationInteracts with Storage Subsystem
Object Management OperationsObject Management Operations(in API-M)(in API-M)
Ingest Object Get Object XML Export Object Withdraw Object Delete Object Purge Object Obtain Lock on Object Release Lock
Get Object State Get Object Audit Trail
Object StateObject StateState DefinitionA Active
L Locked
R Replicating
N Incomplete
W Withdrawn
C Marked for Deletion
D Pending Deletion
Component Management OperationsComponent Management Operations(in API-M)(in API-M)
Add (Datastream or Disseminator) Modify (Datastream or Disseminator) Withdraw (Datastream or Disseminator) Delete (Datastream or Disseminator) Purge (Datastream or Disseminator) Get (Datastream or Disseminator) Get History (Datastream or Disseminator)
Component StateComponent State
State DefinitionA Active
W Withdrawn
C Marked for Deletion
D Pending Deletion
B Broken Link
Object ValidationObject Validation
Runs before an XML object is accepted into the repository for storage– During ingest of XML object created outside of repository– Before final commit of object built up via API-M
XML Schema Validation (to METS Schema) Fedora Rules Validation (via Schematron)
– Referential integrity– Required values for attributes– Allowable state transitions (object state and component state)
Fedora Storage Sub-SystemFedora Storage Sub-System
E x ter n a lC o n ten tS o u rc e
E x ter n a lC o n ten tS o u rc e
http
ftp
E x ter n a l C o n ten tR etr iev er
X M L F ile s
R elat io n al D B
h t tp
ftp
D atas tr eam S to r ag e
D ig ita l O b jec t S to r ag eS to rag e S u b s ys te m
M an ag edC o n ten t
HT
TP
h t tp
SO
AP
h t tp
s m tp
o th er
R em o teBeh av io r
M ec h an is mS er v ic e
httphttp
SO
AP
R em o teBeh av io r
M ec h an is mS er v ic e
HT
TP
D isse m in a tio n
C o m p o n e n tM a n a g e m e n t
O b j e c tR e f le c tio n
P o l icies
O b j e c tV a lid a tio n s v c1
s v c2
O b j e c tM a n a g e m e n t
P I DG e n e r a tio n
P o lic yM a n a g e m e n t
U s ers /G ro u p sL o cal
S erv ices
M an ag e m e n tS u b s ys te m
S e c u rityS u b s ys te m
A c c e s sS u b s ys te m
Storage Sub-system:Storage Sub-system: Object Reader and Writer Modules Object Reader and Writer Modules
DOWriter DOReader
DisseminatingDOReader
DefinitiveDOReaderDefinitiveDOWriter FastDOReader
XMLObjects
DisseminationCache DB
External Content
Retriever
BehaviorService Resolver
DataStore
DataStore-FS
ManagedContent
Datastreams
Digital Object XML Digital Object XML The Inner ViewThe Inner View
PIDSystem Metadata: Object Audit TrailDatastreams : 3 kindsDisseminators: pointing to behavior objects
PID SyntaxPID Syntax
Conforms to URN Syntax (RFC2141) Syntax
– Namespace prefix (hyphen separators)– Delimiter (:)– Object identifier string (sequential number)
Examples:– uva-lib:1225– cornell-cs:101– uva-bdef:10– uva-bmech:12
System Metadata: System Metadata: Object Audit TrailObject Audit Trail
Audit trail record put in object for all API-M transactions
Fedora XML Schema for Audit trail metadata:
<fedoraAudit:record><fedoraAudit:process type=“”/><fedoraAudit:action> </fedoraAudit:action><fedoraAudit:responsibility> </fedoraAudit:responsibility><fedoraAudit:date> </fedoraAudit:date><fedoraAudit:justification> </fedoraAudit:justification>
</fedoraAudit:record>
Datastreams: 3 KindsDatastreams: 3 Kinds
Referenced External Content– Outside custodianship of repository– Referenced in digital object XML via a URI
Repository Managed Content– Under custodianship of repository– Referenced in digital object XML via internal
datastream identifier Implementer-defined XML Metadata
– Bound to the digital object– Stored in-line as part of digital object XML
Digital Object XML StorageDigital Object XML StorageSample ObjectSample Object
Click here for test digital object
Digital Object XML Storage Digital Object XML Storage Sample Behavior Definition ObjectSample Behavior Definition Object
Click here for Behavior Definition object for Image-W
Digital Object XML Storage Digital Object XML Storage Sample Behavior Mechanism ObjectSample Behavior Mechanism Object
Click here for Behavior Mechanism object for UVA_Image_-W
D a ta s tre a m B in din g S pe c
¤ D S B in din g K e y _ D B I D
B M EC H _ D B I D
D S Bin d in g Key _ N am e
D S Bin d in g Key _ O r d in a lity _ F lag
D S Bin d in g Key _ C ar d in a lity
D S Bin d in g Key _ M I M E
D S Bin d in g Key _ L ab e l
Pa ra m e te r
¤ M ETH _ D B I D
¤ B D EF_ D B I D
P AR M _ N am e
P AR M _ D ef au lt_ Valu e
P AR M _ R eq u ir ed _ F lag
P AR M _ L ab el
B e h a v io rD e f in it io n
¤ B D EF _ D B I D
BD E F _ P I D
BD E F _ L ab e l
M e th o d
¤ M ETH _ D B I D
¤ B D EF _ D B I D
M E T H_ N am e
M E T H_ L ab el
D ig ita lO bje ct
¤ D O _ D B I D
D O _ P I D
D O _ L ab el
F E D O R A D issem in ation D atab ase S ch em a
B e h a v io rM e ch a n is m
¤ B M EC H _ D B I D
B D EF _ D B I D
BM E C H_ P I D
BM E C H_ L ab el
M e ch a n is m I m pl
¤ B M EC H _ D B I D
¤ B D EF_ D B I D
¤ M ETH _ D B I D
D S B in din g K e y _ D B I D
M E C HI m p l_ P r o to c o l_ T y p e
M E C HI m p l_ M I M E
M E C HI m p l_ Ad d r es s _ L o c a tio n
M E C HI m p l_ O p er a t io n _ L o c a tio n
PO L I C Y _ D B I D
D ig ita lO bje ctD is s A s s o c
¤ D O _ D B I D
¤ D I S S _ D B I D
D is s e m in a to r
¤ D I S S _ D B I D
B D EF _ D B I D
B M EC H _ D B I D
D I S S _ I D
D I S S _ L ab e l
Po licy
¤ PO L I C Y _ D B I D
P O L I C Y_ N am e
P O L I C Y_ R u le
P O L I C Y_ L ab e l
D a ta s tre a m B in din g M a p
¤ B in din g M a p_ D B I D
B M EC H _ D B I D
Bin d in g M ap _ I D
Bin d in g M ap _ L ab e l
D a ta S tre a m B in din g s
¤ D O _ D B I D
¤ D S B in din g K e y _ D B I D
Bin d in g M ap _ D BI D
D O Bin d in g M ap _ D S Bin d in g Key _ S eq
D O Bin d in g M ap _ D S _ I D
D O Bin d in g M ap _ D S _ L ab e l
D O Bin d in g M ap _ D S _ M I M E
D O Bin d in g M ap _ D S _ L o c a tio n
PO L I C Y _ D B I D