23
Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries Paul Lightcap, Florida State University Libraries Matthew Miguez, Florida State University Libraries Charleston Conference, 2015-11-07

Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

Embed Size (px)

Citation preview

Page 1: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

Acquisitions, EverywhereModeling an acquisitions data standard to connect a distributed environment

Eric Hanson, North Carolina State University LibrariesPaul Lightcap, Florida State University LibrariesMatthew Miguez, Florida State University Libraries

Charleston Conference, 2015-11-07

Page 2: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

Genesis of an acquisitions standard

▪ Substantial work on standards development for: • resource description• e-resource management• discovery…but where is acquisitions in the mix?*

▪ Is acquisitions too complex? Too locally customized? Too ad hoc?

* We’ll get to the NISO Cost of Resource Exchange (CORE) in just a moment!

Solution: a lightweight standard composed of elements already in use in legacy acquisitions systems, modeled in multiple schema and systems

Page 3: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

The legacy challenge, compounded

▪ Expectation for legacy system migration: acquisitions data loss

▪ New challenges:

• new systems and models (library services platforms)

• multiple systems and data maintenance (ILS, ERMS, DAMS)

• data-driven culture

• increasing scrutiny of expenditures, (esp. for public $)

• new content that just doesn’t fit within the ILS

Page 4: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

A lightweight standard▪ Proposed: a core set of elements, already used within legacy systems to manage

acquisitionsAcquisitions Core Element Authority/Source ExampleCustomerID VIAF http://viaf.org/viaf/125490841AccountNumber (vendor) 123456VendorID VIAF or NCSU ONLD http://www.lib.ncsu.edu/ld/onld/00000728InvoiceID (vendor) 001AInvoiceDate ISO 8601 2015-08-01Invoiceline (vendor) 001OrderID (ILS/system or vendor) 8675309-1OrderFiscalCycle ISO 8601 2015MaterialType IANA Media Types application/httpOrderType (ILS/system) One-timeFundID (ILS/system) FAKABTLInvoiceFiscalCycle ISO 8601 2016TotalPaid ISO 4217 127.50ListPrice ISO 4217 150.00Currency ISO 4217 USD

Note: all examples are fabricated, excepting URIs; formatting adheres to indicated standard

Page 5: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

A standard solution: solving problems

▪ Solves problems of duplication in multiple systems and data drift

▪ Allows for non-duplicative application of acquisitions data across all systems

▪ Facilitates complex reporting within all of a library’s systems, as well as between multiple libraries (as within consortia)

▪ Offers a tentative step forward to rethinking acquisitions workflows within a distributed environment

Page 6: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

A standard solution to duplication and data drift

System Title Publisher Vendor Fund Invoice (year) TotalPaid

ILS E-Journal A Wiley-Blackwell Wiley/Blackwell FAKEJTL 2015 1500.00

ILS Book A Wiley-Blackwell Wiley/Blackwell FAKABTL 2015 150.00

ERMS E-Journal A(2)(title change) Wiley-Blackwell Wiley-Blackwell FAKEJTL 2015 1500.00

Spreadsheet E-Journal A(3)(another title change) Wiley-Blackwell Wiley FAKEJTL 2015 1500.00

Note: all examples are fabricated

Or, what if this (our current situation)…

Page 7: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

System Title Publisher Vendor Fund Invoice (year) TotalPaid

ILS E-Journal A Wiley-Blackwell Wiley/Blackwell FAKEJTL 2015 1500.00

ILS Book A Wiley-Blackwell Wiley/Blackwell FAKABTL 2015 150.00

ERMS E-Journal A(2)(title change) Wiley-Blackwell Wiley-Blackwell FAKEJTL 2015 1500.00

Spreadsheet E-Journal A(3)(another title change) Wiley-Blackwell Wiley FAKEJTL 2015 1500.00

Note: all examples are fabricated

…became this:

A standard solution to duplication and data drift

Page 8: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

A standard solution to duplication and data drift

System Title Publisher Vendor Fund Invoice (year) TotalPaid

ILS E-Journal A Wiley-Blackwell Wiley/Blackwell FAKEJTL 2015 1500.00

ILS Book A http://viaf.org/viaf/125715895 http://viaf.org/viaf/125715895 FAKABTL 2015 150.00

ERMS E-Journal A2(title change) http://viaf.org/viaf/125715895 http://viaf.org/viaf/125715895 FAKEJTL 2015 1500.00

Spreadsheet E-Journal A3(another title change) Wiley-Blackwell Wiley FAKEJTL 2015 1500.00

Note: all examples are fabricated, except URIs

…or even this:

Page 9: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

A standard solution for new content types

Page 10: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries
Page 11: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

A standard solution for complex reporting

System Title Publisher* CustomerID VendorID FundID InvoiceFiscalCycle TotalPaid

ILS Print Journal A Wiley-Blackwell http://viaf.org/viaf/125490841 http://www.lib.ncsu.edu/ld/onld/00000728 FAKEJTL 2015 800.00

ILS Book A Wiley-Blackwell http://viaf.org/viaf/125490841 http://www.lib.ncsu.edu/ld/onld/00000728 FAKABTL 2015 150.00

ERMS E-Journal A Wiley-Blackwell http://viaf.org/viaf/125490841 http://www.lib.ncsu.edu/ld/onld/00000728 FAKEJTL 2015 1600.00

ILS Print Journal A Wiley-Blackwell http://viaf.org/viaf/131675600 http://www.lib.ncsu.edu/ld/onld/00000682 LDEJL-GHSS 2015 700.00

ERMS E-Journal A Wiley-Blackwell http://viaf.org/viaf/131675600 http://www.lib.ncsu.edu/ld/onld/00000728 LDEJL-GHSS 2015 1400.00

Digital Library Data set A AfriGIS http://viaf.org/viaf/125490841 http://www.lib.ncsu.edu/ld/onld/00000728 FAKEJTL 2015 500.00

Institutional Repository

Scanned newspaper A SEP S.p.A http://viaf.org/viaf/125490841 http://www.lib.ncsu.edu/

ld/onld/00000728 FAKEJTL 2015 1200.00

Page 12: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

A tentative step forward

▪ Acquisitions data bundled to resource description across varying schemas and within various systems

▪ System-agnostic serialization technologies XML, JSON

▪ Possibility to utilize OAI-PMH to harvest data across multiple systems, so that this request:

OAI-PMH_root_URL?verb=GetRecord&metadata Prefix=mods&identifier=oai.fsu.digital.flvc.org:fsu_7970

Page 13: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

Returns this:

Page 14: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

Identity Management With Linked Data

Page 15: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

● Identity management becomes important as we seek interoperability between systems through this acquisitions metadata schema

● Different systems use different labels for the same organization

Royal Society of Chemistry

The Royal Society of ChemistryRoyal Society of Chemistry

Publishing

Royal Society of Chemistry Pub

Royal Society of Chemistry (RSC)

Identity Management With Linked Data

Page 16: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

● Moving beyond labels and using Uniform Resource Identifiers (URIs) allows us to unambiguously identify organizations in various data sets

URIs for the Royal Society of Chemistry

Virtual International Authority File (VIAF)URI: http://viaf.org/viaf/128803576

Library of Congress Name Authority File (LCNAF)URI: http://id.loc.gov/authorities/names/n81122723

North Carolina State University Organization Name Linked Data (ONLD)URI: http://www.lib.ncsu.edu/ld/onld/00000010

Uniform Resource Identifiers (URIs)

Page 17: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

● NCSU Organization Name Linked Data (ONLD) is based on the NCSU Organization Name Authority, a tool used to manage the variant forms of name for serial and e-resource publishers, providers, and vendors in E-Matrix, our locally-developed electronic resource management system.

● Includes links to other data sets including VIAF, LCNAF, ISNI, DBpedia, and Freebase.

● ONLD served as the seed data for organizations in the Global Open Knowledgebase (GOKb) http://gokb.org/

● Also used by AgriProfiles (formerly AgriVIVO) to help manage organization names http://www.agriprofiles.net/page/what-new-agriprofiles

Overview of NCSU ONLD

Page 18: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

● The NCSU ONLD can used to crosswalk multiple labels to a single URI

URIhttp://www.lib.ncsu.edu/ld/onld/00000010

Preferred LabelRoyal Society of Chemistry

Alternate LabelsThe Royal Society of ChemistryRoyal Society of Chemistry PublishingRoyal Society of Chemistry PubRoyal Society of Chemistry (RSC)

Links to URIs From Other Data Setshttp://viaf.org/viaf/128803576http://id.loc.gov/authorities/names/n81122723

Crosswalking Labels to URIs

ILS

Title List

ERMS

Label: The Royal Society of Chemistry

Label: Royal Society of Chemistry Publishing

Label: Royal Society of Chemistry

Page 19: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

● The preferred and alternate labels NCSU ONLD can also be used to crosswalk local labels for a vendor to VIAF & LCNAF URIs

ILS

Vendor Title List

Label: The Royal Society of Chemistry

Label: Royal Society of Chemistry Publishing

Label: Royal Society of ChemistryLCNAF

VIAF

NCSU ONLD

Alternate Labels

URIs

URIs

URIs

Crosswalking Labels to URIs

ERMS

Page 20: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

● Using URIs rather than labels allow for easier integration between multiple data sets

Using URIs to link data sources

ILSVendor Title List

URI: http://viaf.org/viaf/128803576Label: The Royal Society of Chemistry

URI: http://viaf.org/viaf/128803576Label: Royal Society of Chemistry Publishing

URI: http://viaf.org/viaf/128803576Label: Royal Society of Chemistry

ERMS Data Elements:Title

Call NumberPurchase DateAmount Paid

Data Elements:Title

License InformationUsage

Data Elements:Title

List PricePackage

Vendor Subject Classification

Page 21: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

URI: http://viaf.org/viaf/128803576Label: The Royal Society of Chemistry

● Use case: building vendor-specific reports based on elements from multiple data sources○ Comparing vendor list price against the ILS payment data to calculate savings○ Calculating the money paid for titles covered by a particular vendor license○ Analyzing usage by subject area based on both ILS call number and vendor

subject classification

Using URIs to link data sources

ILSVendor Title List

URI: http://viaf.org/viaf/128803576Label: Royal Society of Chemistry Publishing

URI: http://viaf.org/viaf/128803576Label: Royal Society of Chemistry

ERMSData Elements:Title

List PricePackage

Vendor Subject Classification

Data Elements:Title

Call NumberPurchase DateAmount Paid

Data Elements:Title

License InformationUsage

Page 22: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

Acquisitions standards: next steps

▪ Solicit feedback from community of practice for refined element set

▪ Continue to develop proofs of concept and use cases

▪ Consider moving ahead with actual standard development

Page 23: Acquisitions, Everywhere Modeling an acquisitions data standard to connect a distributed environment Eric Hanson, North Carolina State University Libraries

Standards!

Thank you!

Paul Lightcap, [email protected] Miguez, [email protected] Hanson, [email protected]