37
Harvesting Using the Open Archives Initiative Protocol: What Can Your OAI Stream Tell You? Sandra McIntyre, MWDL Director Anna Neatrour, MWDL Digital Metadata Librarian

Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Embed Size (px)

DESCRIPTION

Webinar from the Mountain West Digital Library Sandra McIntyre, MWDL Director Anna Neatrour, MWDL Digital Metadata Librarian Want to understand what happens behind the scenes with the MWDL harvesting? In this webinar, Sandra McIntyre and Anna Neatrour will explain the Open Archives Initiative Protocol for Metadata Harvestiong (OAI-PMH) and how it makes metadata aggregation possible in the MWDL. They will explain the process of harvesting and how MWDL normalizes your metadata. They will also show you how you can learn from your collections' OAI stream by using the six query verbs (requests) defined in the OAI-PMH.

Citation preview

Page 1: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Harvesting Using the Open Archives Initiative Protocol:

What Can Your OAI Stream Tell You?

Sandra McIntyre, MWDL DirectorAnna Neatrour, MWDL Digital Metadata Librarian

Page 2: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

WHY OAI?The basics

Page 3: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Open Archives Initiative

Open Archives Initiative http://openarchives.org“Standards for Web Content Interoperability”

• Facilitate the efficient dissemination of content contained in archives/repositories

• Low-barrier framework and standards

Page 4: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Why is a protocol necessary?

OAI ProviderOAI Harvester

“I want it.” “I have it.”

?

?“Give me...”

“Here is what you requested.”

Page 5: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

OAI-PMH

Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)http://www.openarchives.org/pmh/

Page 6: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

OAI Providers

Page 7: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

OAI Providers

Page 8: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

OAI Harvesters

OAIsterhttp://oaister.worldcat.org

and included in WorldCat

Mountain West Digital Libraryhttp://mwdl.org

Digital Public Library of Americahttp://dp.la/

Institute of Museum & Library Services

Digital Collections and Content http://imlsdcc.grainger.uiuc.edu

...and thousands more

Page 9: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Harvesting at MWDL

Mountain West

Digital Library

Univ of Utah

BYU

Southern Utah Univ

Utah Valley Univ

Utah State Univ

Weber State Univ

Salt Lake Comm. College

Utah State

Archives

Utah State

Library

Univ of Nevada

Las Vegas

Univ of Nevada

Reno

Utah Dvsn Arts

& Museums

Idaho State

ArchivesArizona Memory Project

Snow College

Northern Arizona

Univ

Univ of Idaho

Family Search

LDS Church History

Montana Memory Project

Stacks (Idaho)

Boise State Univ.

Page 10: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Why understand OAI?

• Predict what will happen with your metadata when it is harvested

• Do self-auditing and/or peer auditing of metadata: See patterns and find errors

Page 11: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Other metadata harvesting options

• Handing over a hard drive• Uploading/downloading via file

transfer protocol (FTP)• Other requests of XML (typically

application programming interfaces, APIs):– Web Services– X-Services

Page 12: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Advantages of OAI

• Update at a distance, anytime • Specify desired records

– By collection– By date range of last change to record

• Packets, one at a time• Works fast• Repeatable

Page 13: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

THE PROTOCOLQueries and responses

Page 14: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Queries and Responses

OAI Provider

OAI Harvester

OAI respons

e

OAI query

Page 15: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Testing an OAI Provider

http://re.cs.uct.ac.za/

Page 16: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Queries:OAI BaseURL

BaseURL = OAI provider root address(Doesn’t work alone)

Examples:• http://aura.abdn.ac.uk/dspace-oai/reques

t• http://absronline.org/journals/index.php/i

ndex/oai• http://cyberleninka.ru/oai• http://digitalcommons.usu.edu/cgi/oai2.c

gi• http://www.avhumboldt.net/oai/oai.php

OAI query

Page 17: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Queries:6 Verbs

Verb = type of requestInitial capitals; no spaces

Examples:• Identify• ListMetadataFormats• ListSets• ListIdentifiers• ListRecords• GetRecord

OAI query

Page 18: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Queries:Parameters & Values

Parameters & values = details about request

Format: parameter=valueExamples:• metadataPrefix=oai_dc• metadataPrefix=qdc• set=awhof• identifier=oai:content.lib.utah.edu:etd3/48

2

OAI query

Page 19: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

EXAMPLESQueries you can use

Page 20: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Identify

OAI Provider

OAI Harvester

OAI respons

e

OAI query

http://contentdm.li.suu.edu/oai/oai.php?verb=Identify“Who are you?”

“I am the SUU CONTENTdm Server

Repository.”

Page 21: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

“I am the SUU CONTENTdm Repository.”

Identify

Page 22: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

ListSets

OAI Provider

OAI Harvester

http://contentdm.li.suu.edu/oai/oai.php?verb=ListSets

“What sets do you have available?”

“Here is the list of sets.”

OAI respons

e

OAI query

Page 23: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

“Here’s the list of sets.”ListSets

Page 24: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

ListMetadataFormats

OAI Provider

OAI Harvester

http://contentdm.li.suu.edu/oai/oai.php?verb=ListMetadataFormats“What metadata formats are available?”

“Here’s the list of metadata formats.”

OAI respons

e

OAI query

Page 25: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

“Here’s the list of metadata formats.”

ListMetadataFormats

Page 27: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

“Here are the records.”

ListRecords

Page 28: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

ListRecords

• One set only:http://contentdm.li.suu.edu/oai/oai.php?verb=ListRecords&metadataPrefix=oai_qdc&set=hist_photos

• If more than one screen of records, use a resumption token to get the additional lists (200 at a time in this example):http://contentdm.li.suu.edu/oai/oai.php?verb=ListRecords&resumptionToken=hist_photos:200:hist_photos:0000-00-00:9999-99-99:oai_qdc

Page 30: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

• Turning on OAI: Administrative interface in the “Server” tab

• Choosing which collections to share• Sharing compound object level metadata only

Image from CONTENTdm OAI guide: http://contentdm.org/help6/server-admin/oai.asp

CONTENTdm’s OAI Provider

Page 31: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Record -> OAILocal Record with Labels OAI

Page 32: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

OAI -> MWDL

OAI MWDL

Page 33: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

MWDL -> DPLA

MWDL DPLA

Page 34: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Some Final Things to Remember

• Check your own OAI stream and see what it looks like!– Mapped to none – not in OAI stream– Hidden set to yes – not in OAI stream– CONTENTdm field properties template and guide

available at: http://mwdl.org/getinvolved/getinvolved.php

– Login to collection admin, click on tab, go to fields to check and edit properties

Page 35: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Field Mappings in CONTENTdm

Field Mapping example from the Western Soundscape Archive

Page 36: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

Try it yourself!

Resources available athttp://mwdl.org/getinvolved/getinvolved.php

Page 37: Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream Can Tell You

We’re here to help!

• For additional questions about self-auditing your OAI contact Anna Neatrour:– [email protected]– 801-587-8883

• Any Questions?