Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Preview:

DESCRIPTION

Webinar from the Mountain West Digital Library Sandra McIntyre, MWDL Director Anna Neatrour, MWDL Digital Metadata Librarian Want to understand what happens behind the scenes with the MWDL harvesting? In this webinar, Sandra McIntyre and Anna Neatrour will explain the Open Archives Initiative Protocol for Metadata Harvestiong (OAI-PMH) and how it makes metadata aggregation possible in the MWDL. They will explain the process of harvesting and how MWDL normalizes your metadata. They will also show you how you can learn from your collections' OAI stream by using the six query verbs (requests) defined in the OAI-PMH.

Citation preview

Harvesting Using the Open Archives Initiative Protocol:

What Can Your OAI Stream Tell You?

Sandra McIntyre, MWDL DirectorAnna Neatrour, MWDL Digital Metadata Librarian

WHY OAI?The basics

Open Archives Initiative

Open Archives Initiative http://openarchives.org“Standards for Web Content Interoperability”

• Facilitate the efficient dissemination of content contained in archives/repositories

• Low-barrier framework and standards

Why is a protocol necessary?

OAI ProviderOAI Harvester

“I want it.” “I have it.”

?

?“Give me...”

“Here is what you requested.”

OAI-PMH

Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)http://www.openarchives.org/pmh/

OAI Providers

OAI Providers

OAI Harvesters

OAIsterhttp://oaister.worldcat.org

and included in WorldCat

Mountain West Digital Libraryhttp://mwdl.org

Digital Public Library of Americahttp://dp.la/

Institute of Museum & Library Services

Digital Collections and Content http://imlsdcc.grainger.uiuc.edu

...and thousands more

Harvesting at MWDL

Mountain West

Digital Library

Univ of Utah

BYU

Southern Utah Univ

Utah Valley Univ

Utah State Univ

Weber State Univ

Salt Lake Comm. College

Utah State

Archives

Utah State

Library

Univ of Nevada

Las Vegas

Univ of Nevada

Reno

Utah Dvsn Arts

& Museums

Idaho State

ArchivesArizona Memory Project

Snow College

Northern Arizona

Univ

Univ of Idaho

Family Search

LDS Church History

Montana Memory Project

Stacks (Idaho)

Boise State Univ.

Why understand OAI?

• Predict what will happen with your metadata when it is harvested

• Do self-auditing and/or peer auditing of metadata: See patterns and find errors

Other metadata harvesting options

• Handing over a hard drive• Uploading/downloading via file

transfer protocol (FTP)• Other requests of XML (typically

application programming interfaces, APIs):– Web Services– X-Services

Advantages of OAI

• Update at a distance, anytime • Specify desired records

– By collection– By date range of last change to record

• Packets, one at a time• Works fast• Repeatable

THE PROTOCOLQueries and responses

Queries and Responses

OAI Provider

OAI Harvester

OAI respons

e

OAI query

Testing an OAI Provider

http://re.cs.uct.ac.za/

Queries:OAI BaseURL

BaseURL = OAI provider root address(Doesn’t work alone)

Examples:• http://aura.abdn.ac.uk/dspace-oai/reques

t• http://absronline.org/journals/index.php/i

ndex/oai• http://cyberleninka.ru/oai• http://digitalcommons.usu.edu/cgi/oai2.c

gi• http://www.avhumboldt.net/oai/oai.php

OAI query

Queries:6 Verbs

Verb = type of requestInitial capitals; no spaces

Examples:• Identify• ListMetadataFormats• ListSets• ListIdentifiers• ListRecords• GetRecord

OAI query

Queries:Parameters & Values

Parameters & values = details about request

Format: parameter=valueExamples:• metadataPrefix=oai_dc• metadataPrefix=qdc• set=awhof• identifier=oai:content.lib.utah.edu:etd3/48

2

OAI query

EXAMPLESQueries you can use

Identify

OAI Provider

OAI Harvester

OAI respons

e

OAI query

http://contentdm.li.suu.edu/oai/oai.php?verb=Identify“Who are you?”

“I am the SUU CONTENTdm Server

Repository.”

“I am the SUU CONTENTdm Repository.”

Identify

ListSets

OAI Provider

OAI Harvester

http://contentdm.li.suu.edu/oai/oai.php?verb=ListSets

“What sets do you have available?”

“Here is the list of sets.”

OAI respons

e

OAI query

“Here’s the list of sets.”ListSets

ListMetadataFormats

OAI Provider

OAI Harvester

http://contentdm.li.suu.edu/oai/oai.php?verb=ListMetadataFormats“What metadata formats are available?”

“Here’s the list of metadata formats.”

OAI respons

e

OAI query

“Here’s the list of metadata formats.”

ListMetadataFormats

“Here are the records.”

ListRecords

ListRecords

• One set only:http://contentdm.li.suu.edu/oai/oai.php?verb=ListRecords&metadataPrefix=oai_qdc&set=hist_photos

• If more than one screen of records, use a resumption token to get the additional lists (200 at a time in this example):http://contentdm.li.suu.edu/oai/oai.php?verb=ListRecords&resumptionToken=hist_photos:200:hist_photos:0000-00-00:9999-99-99:oai_qdc

• Turning on OAI: Administrative interface in the “Server” tab

• Choosing which collections to share• Sharing compound object level metadata only

Image from CONTENTdm OAI guide: http://contentdm.org/help6/server-admin/oai.asp

CONTENTdm’s OAI Provider

Record -> OAILocal Record with Labels OAI

OAI -> MWDL

OAI MWDL

MWDL -> DPLA

MWDL DPLA

Some Final Things to Remember

• Check your own OAI stream and see what it looks like!– Mapped to none – not in OAI stream– Hidden set to yes – not in OAI stream– CONTENTdm field properties template and guide

available at: http://mwdl.org/getinvolved/getinvolved.php

– Login to collection admin, click on tab, go to fields to check and edit properties

Field Mappings in CONTENTdm

Field Mapping example from the Western Soundscape Archive

Try it yourself!

Resources available athttp://mwdl.org/getinvolved/getinvolved.php

We’re here to help!

• For additional questions about self-auditing your OAI contact Anna Neatrour:– anna.neatrour@utah.edu– 801-587-8883

• Any Questions?

Recommended