Upload
maximillian-cox
View
212
Download
0
Tags:
Embed Size (px)
Citation preview
Getting Started with CONTENTdm
Corey Harper, University of Oregon
Terry Reese, Oregon State University
OLA - April 8, 2005
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Road Map
Introduction to CONTENTdm's field properties and search interfaces Determining important access points Discussion of Dublin Core mapping CONTENTdm's controlled vocabulary structure Setting up controlled vocabularies Use of home grown vocabularies Metadata interoperability Shared standards and best practices Western States Best Practices Working with Western Waters Managing control vocabularies between projects Demo some of OSU and UO's collections Q&A
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm
Search Interfaces 3+-3.8 provides three primary search interfaces
1. Browse Search2. Advance Search3. Custom Search
4 display methods Grid view Bibiographic view Thumbnail view Title view
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Browse Interface
Browses all items in the collection Browse ordered alphebetically
No skip characters (the, a, I, an used to determine order)
The pages of CONTENTdm Compound objects are not show in the browse interface.
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Browse Interface (Grid view)
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Browse Interface (Thumbnail view)
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Browse Interface (Bibliographic view)
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Browse Interface (Title view)
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Browse Interface (Custom view)
Hyperlink Example
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Advanced Interface
Two types of searches Searching across all fields Searching on a particular field
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Advanced Interface
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Advanced Interface
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Setting up field properties
Field properties set in the administration area
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Setting up field properties
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Setting up field properties
Dublin Core Mappings
Data Types:Text
Date (format: dd/mm/yyyy)
Full Text Search (OCR’d data)
TitleSubjectDescriptionCreatorPublisherContributorsDateTypeFormatIdentifierSourceLanguageRelationCoverageRightsAudienceTitle-AlternativeDescription-Table Of ContentsDescription-AbstractDate-CreatedDate-ValidDate-AvailableDate-IssuedDate-ModifiedFormat-ExtentFormat-MediumRelation-Is Version OfRelation-Has VersionRelation-Is Replaced ByRelation-ReplacesRelation-Is Required ByRelation-RequiresRelation-Is Part OfRelation-Has PartRelation-Is Referenced ByRelation-ReferencesRelation-Is Format OfRelation-Has Format OfRelation-Conforms ToCoverage-SpatialCoverage-TemporalAudience-MediatorNone
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Starting a project
What do we scan? The first and most important part of the
collection building process. Every institution has great stuff to digitize but…. Digital collections need to be treated like
traditional materials, i.e., are vetted or proposed by an organization’s subject specialist.
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Starting a project
Working with stakeholders: Working with your subject specialists can help
you to identify:1. Organizational stakeholders (departments, groups,
etc.)
2. Outside stakeholders (both public and private)
3. Field specific thesauri and classification terms that may be able to be incorporated into the project.
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Starting a project
Determining access points: What metadata will be present? How will it be entered? What best practices or standards will be used in
generating the metadata? What metadata will your stakeholders expect to be
present? What metadata elements will be searchable? Which
will use controlled vocabulary? How will your administrative metadata be stored?
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Introducing CONTENTdm: Starting a project
Access points into the collection: Once the collection has been built – how will it
be accessed? Search types? Example:
http://digitalcollections.library.oregonstate.edu/archives/
http://digitalcollections.library.oregonstate.edu/dna/
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Mapping to Dublin Core
15 Dublin Core Elements 16th element – Audience 26 Qualified DC Elements (from dcterms
namespace) None – Special value – field is not mapped
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
OAI-PMH
Open Archives Initiative Protocol for Metadata Harvesting
I’ll be talking about this some, as will Terry OAI-PMH is a protocol, layered over HTTP Response format encoded as XML Defines format for requests and responses used for
harvesting metadata from collections Used to build federated search interfaces http://www.openarchives.org/
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Some notes on mapping
Effect on searching – across collections, search based on DC Mapping
Effect on OAI output 15 elements and nothing more. Qualified elements “dumb down” to Simple DC “none” and “Audience” aren’t included in OAI
results
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
More on Mapping
Collaborative projects should determine what information to map to Dublin Core fields
Western States documentation provides guidance on mapping
Effect on search results at centralized search interfaces, e.g. Western Waters
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Effects of Bad Mapping
Poor mapping decisions can cause problems Cluttered results in cross collection searches Cluttered results in federated searching Inconsistency in where pieces of information are
found Important information not harvested
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Decisions are rarely final
CONTENTdm is extremely flexible Can easily change mappings, indexing, display,
vocabularies Can add and delete fields at any time
This can be both good and bad
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Controlled Vocabularies in CONTENTdm
Supports SEE FROM type cross-references No support for SEE ALSO or hierarchical
(BT, NT, RT) One term per line in a text file Cross-references: x-ref USE heading
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Vocabularies on the Server
Stored as text files in vocab folder: [nickname].txt; e.g. subjec.txt
Additional vocabulary files stored in the text_search folder: voc.[nickname] – Vocabulary terms used in instance data use.[nickname] – X-refs to terms used in instance data vocuse.[nickname] – Both terms and X-refs
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Vocabulary Index Generator
Terry’s PHP Script to create hyperlinked lists of vocabulary terms: http://oregonstate.edu/~reeset/contentdm
/downloads.html
Excellent for “Browse by Subject” pages Configurable to include x-refs Other useful tools available at this URL
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Available Vocabularies
Software comes with TGM-I LCSH and MeSH available from User Support
Center (requires login) MeSH – 29,000 entries – headings only LCSH – 156,411 entries – 64,959 headings &
91,452 x-refs. X-Refs include 400, 410, 411 and 450 references from
authority records coded as 150 with 008 bit 09 “a”
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Other Vocabularies
Adding terms to existing vocabularies Be careful: update when new versions available
Creating CONTENTdm formated versions of other vocabularies: DC Type MIME Type GSFAD
Creating home-grown vocabularies from scratch
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Home Grown Vocabularies
May be useful to combine vocabularies or create new ones. Example – Combining terms from GSFAD &
from TGM-II (GMGPC) Controlling a list of Collection Names,
Collection Identifiers, Source Conditions, etc.
Generating Vocabs from a fields contents
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Temporary Vocabularies
Instance data for fields using vocabularies Term A; Term B; Term C
Use same syntax for non-controlled fields that contain multiple entries repeatable fields in the DC sense
Create vocabs from field contents for normalization and quality control Browse the defined vocabulary to locate problems
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
CONTENTdm and metadata interoperability
Issues to consider: Interoperability between metadata formats
Dublin Core => MARC, etc. Interacting with federated searching
Interacting with federated search tools Understanding how your metadata could be
harvested via OAI.
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
CONTENTdm and metadata interoperability
Building federated collections: By considering metadata interoperability you
can build federated tools based on OAI: http://fluffy.library.oregonstate.edu/contentdm/searc
h/index.php Build tools that integrate with other federated
search tools: http://fluffy.library.oregonstate.edu/a9/search.php
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Shared standards and best practices
Metadata interoperability and shared standards go hand in hand.
Shared standards are essentially a “trust” contract between groups of users that their metadata will conform to a specific set of rules.
Examples: MARC & AACR2
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Shared standards: western waters best practices
Shared digital library standards: Western states Dublin Core best practices
http://www.cdpheritage.org/resource/metadata/wsdcmbp/
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Working with Western Waters
Western Waters Digital Library – Federated CONTENTdm catalog of 12 academic libraries.
Projects contributed to the WWDL require metadata that meets both local and consortial standards.
As with any consortial arrangement – compromise is sometimes going to be necessary.
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Working with Western Waters
More stuff
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Managing controlled vocabularies between projects
CONTENTdm has no built-in facility to share controlled vocabularies between projects.
Two methods: 1) use Unix diff function to locate differences
between in use vocab. between projects and then manually adding missing elements or correcting conficts between projects.
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Managing controlled vocabularies between projects
2. Build your own management:1. Example:
http://fluffy.library.oregonstate.edu/contentdm/builder.html
Terry Reese - Corey Harper Oregon Library Association - April 8, 2005
Contact Us
Terry Reese - [email protected] Corey Harper – [email protected]