24
LibraryCloud July 28, 2014 - ABCD

Library cloud abcd

Embed Size (px)

DESCRIPTION

LibraryCloud API at Harvard

Citation preview

Page 1: Library cloud   abcd

LibraryCloudJuly 28, 2014 - ABCD

Page 2: Library cloud   abcd

Agenda

● What is LibraryCloud?● The API● The architecture● Q&A

Page 3: Library cloud   abcd

What is LibraryCloud?

Metadata platformConnect information flows underneathProvide programmatic access on top

Page 4: Library cloud   abcd

Who is it for?

LTS toolInnovators/developersOther sites & platforms that want to integrate with library metadata

Page 5: Library cloud   abcd

Example use casesCustom OPAC (e.g., Stacklife)Library analyticsSource of metadata for digital collections and exhibitsSpecialized search engineIntegrate with Teaching & Learning TechnologiesRecommendation engine (eventually)Integrate with Web sources (e.g. NPR, Wikipedia)

Page 6: Library cloud   abcd

Two versions

v1: Library Innovation Lab prototypev2: Harvard Library

Workflow-basedRobust

Page 7: Library cloud   abcd

The Item API

Page 8: Library cloud   abcd

The Basics

Location: http://api.lib.harvard.edu/v2/items

Documentation: https://wiki.harvard.edu/confluence/display/LibraryStaffDoc/Library+Cloud

13 million Aleph open access metadata records in MODS format (XML or JSON)

Page 9: Library cloud   abcd

Design Goals

● Easy to use● Retain the richness of existing library

data

Page 10: Library cloud   abcd

Things you can do

Keyword searchhttp://api.lib.harvard.edu/v2/items?q=peanuts

Search scoped to specific fieldshttp://api.lib.harvard.edu/v2/items?title=peanutshttp://api.lib.harvard.edu/v2/items?title=peanuts&name=Schulz

Search with exact-match, wildcards, stemming.http://api.lib.harvard.edu/v2/items?title_exact=Peanutshttp://api.lib.harvard.edu/v2/items?title=p*nut

Page 11: Library cloud   abcd

More things you can doFacetshttp://api.lib.harvard.edu/v2/items.json?title=peanuts&facets=resourceType

http://api.lib.harvard.edu/v2/items.json?title=peanuts&facets=resourceType,subject.name

Paging and sortinghttp://api.lib.harvard.edu/v2/items.json?title=peanuts&sort.asc=resourceType

http://api.lib.harvard.edu/v2/items.json?title=peanuts&start=100&limit=50

Page 12: Library cloud   abcd

Things you can’t do (yet)

● Retrieve data in Dublin Core format● Search by date● Retrieve metadata about collections of

items● Create your own collections (tagging)● Retrieve metadata about usage of

items● View non-open access records

Page 13: Library cloud   abcd

What lies beneathor, life beyond the API

Page 14: Library cloud   abcd

LibraryCloud is more than the API

Metadata platformConnect information flows underneathProvide programmatic access on top

(this should be familiar)

Page 15: Library cloud   abcd

Design goals

● Enable access to data for use cases that aren’t well supported by an API

● Remove bottlenecks to implementing new features

● Supportable by Library Technology Services (LTS)● Align with Harvard University IT technology

strategy● Extendable to new data sources (internal and

external to Harvard) and new applications

Page 16: Library cloud   abcd

Logical Architecture

Page 17: Library cloud   abcd
Page 18: Library cloud   abcd

Sample Ingest Workflow

Page 19: Library cloud   abcd
Page 20: Library cloud   abcd

What does this get us?

Supportable technology stack: AWS (EC2, SQS), Java servicesScalable ingest processWell-defined interfaces throughout the ingest process

Page 21: Library cloud   abcd

What’s next

● More data sources (VIA, OASIS finding aids, DASH, HGL, usage data etc.)

● Collections API (read/write)● OAI-PMH endpoint

Page 22: Library cloud   abcd

How you can help

Use the API. Let us know what should be added, and what doesn’t work. Submit bug reports. Build cool stuff.

Contact: [email protected]

Page 24: Library cloud   abcd

Questions?