Upload
moses-freeman
View
216
Download
0
Tags:
Embed Size (px)
Citation preview
Nate TrailNetwork Development & MARC Standards Office8/1/2006
With help from Sydney Olive
How to Build, Display and Find METS Objects
•A little about the technology
•How we use it
•Searching and Lucene
•Cocoon Tools
•Future Directions
•Conclusions
Outline
Technology Used• XML documents (METS, MODS, etc.)
• METS Object
• XSL processes • Transformation or display
• Cocoon framework (and a little Java)
XSL for Transformation
XSL for Display
• User input via URL matches
Cocoon Concepts (1)
–<map:match pattern="navigation/*">
<map:generate src="cocoon:/{1}/mets.xml"/>
<map:transform src="navigation.xsl"/>
<map:serialize type="xml"/>–</map:match>
• Combine small matches to create
a pipeline
• Separate Data from Action and Design
Cocoon Concepts (2)
• Database SQL processing
MySQL extract
MODS (SRU)(Three masters LC publication)
How we use it (1)
• METS making pipeline:
o MySQL extract MODS
o ObjectType, Files, MODS, rights datao METSMaker stylesheet uses ObjectType to
create links among files and descriptive metadata, based on METS profile: METS
How we use it (2)
• Cocoon for Display o Navigation, Description,
Pageturning
o xml data for display
o add header, footer,
breadcrumbs
o VHP presentations
How we use it (3)
Searching and Lucene
• We have XML objects, so we need an XML based search system.
• Cocoon comes with Lucene.
• Lucene indexes are easy to configure and modify: METS In x t mde ed i e
Searching
• Search page – limit search to certain object types,
– search certain fields only, or
– search all (default)
• Canned search as collection presentation tool (LC Publications, baseball)
• New browsing capability
Cocoon Tools (1)
• METS Profile Validation (fs01)
• SQL : query any Oracle, MySQL tables from the browser to get back XML
Cocoon Tools (2)
• Site administration, enhancement of processes
– Get IDs for some set of the database for extraction, cleanup
– Convert from MARC to MODS (input) or MODS to MARC: (extract)
– Convert HTML to XML
– Detect deletions of objects
– Link checking
– Search for MARC records using SRU
Future Directions
• New Behaviors, Profiles (Article OCR) (Multivolume Monograph / rarebook source)
• RSS• Integrate profile validation with METS building• Integrate JHOVE file inspection tool (MIX)• Investigate using JPEG2000 file format for image
manipulation enhancement• Submit objects to a repository• Exchange objects with another entity
Conclusions
• What are we learning from building this application?
Conclusions
• The METS standard is flexible enough to describe multiple kinds of complex objects
Conclusions
• METS Profiles help define an object and it’s range of behaviors
Conclusions
• METS and MODS play well together.
Conclusions
• METS files can be built automatically.
Conclusions
• It may look hard, but METS objects can be parsed, validated and used without difficulty
Conclusions
• MODS is extremely powerful as a structural tool, not just a bibliographic tool
• (but good, consistent, authoritative cataloging makes search and display
much more reliable)
Conclusions
• Display is for today only; well structured data lasts.
Conclusions
Without standardized structures, we end up with egg on our face.
Get back to work!!
Photos © 2006 Nate Trail