62
Research Scene setting: New directions for metadata workflows across libraries, archives, museums Günter Waibel Karen Smith- Yoshimura Merrilee Proffitt Thom Hickey University of Calgary February 10 th 2010

OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

Embed Size (px)

DESCRIPTION

Presentation used as scene setting for 2 days worth of discussion around library, archive & museum convergence, metadata workflows and single search at the University of Calgary.

Citation preview

Page 1: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

Research

Scene setting: New directions for metadata workflows across libraries, archives, museums 

Günter WaibelKaren Smith-YoshimuraMerrilee ProffittThom Hickey

University of CalgaryFebruary 10th 2010

Page 2: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

2

Taylor Family Digital Library

Page 3: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

3

Sheila CannellDirector, Library ServicesUniversity of Edinburgh

LA

M:

Cam

pu

s

Con

text

Page 4: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

4

Sheila CannellDirector, Library ServicesUniversity of Edinburgh

“I'm really quite convinced that what we have to do is to boost up the whole area of what would be called the intellectual capital of the university, which I think

happens within our special collections, within our

archives (both those that are older and those that are

being created now), and in our museums. I'm really

interested in how we reposition the traditional

library into that more general area.”

LA

M:

Cam

pu

s

Con

text

Page 5: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

5

Anne Van CampDirector, Smithsonian Institution Archives

LA

M:

Op

portu

nity

&

Ch

alle

ng

e

Page 6: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

6

Anne Van CampDirector, Smithsonian Institution Archives

LA

M:

Op

portu

nity

&

Ch

alle

ng

e

We are 19 museums, 9 research centers that are

scattered across the world, we have 18 different archives, we have 1 library, but that library has 20 branches, and we have

1 zoo. So it's a rather complicated place, and you can imagine the challenge we face

in trying to bring all of this disparate information

together.

Page 7: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

7

Princeton

Smithsonian

Victoria & Albert

U of Edinburgh Yale

OCLC Research LAM Workshops

Page 8: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

8

Beth McKillopKeeper of Asia, Victoria & Albert Museum

On

e

Searc

h

Page 9: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

9

Beth McKillopKeeper of Asia, Victoria & Albert Museum

“The William Morris question remains with us. How do you show what the V&A has to offer to students interested in William Morris: designs by William Morris, archival materials by William Morris, and library books about William Morris.”

Sin

gle

S

earc

h

Page 10: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

10

V&A Core Systems Integration Project (CSIP)

Strategy• Data harvesting for

archival materials• Metasearch for library

and museum content

Page 11: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

11

Yale Office of Digital Assets and Infrastructure (ODAI)

slide courtesy of Meg Bellinger,Ann Green, Louis E. KingYale University's Model for Campus-wide Digital Content Strategy and Implementation (CNI 2009)

www.cni.org/tfms/2009b.fall/Presentations/cni_yale_bellinger.pdf

Page 12: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

12

Strategy• LAMs as OAI data

content providers• ODAI as harvester

Page 13: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

13

Outcome: SI Collections Search Center

Strategy• Central Index• Webservices

Page 14: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

14

Page 15: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

Research

New Directions for Metadata Workflows

Karen [email protected] Research

University of CalgaryFebruary 10, 2010

Page 16: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

16

Current Descriptive Metadata Practices2007 survey of 18 RLG Partners in the US and the UK

that: • had “multiple metadata creation centers” on campus• had some interaction among them

Brigham Young U.Center for Jewish HistoryChemical Heritage FoundationEmory Getty Research InstituteHarvardMinnesota Historical SocietyPrinceton Smithsonian Institution

Syracuse UniversityU. AberdeenUC Los AngelesU. CambridgeU. EdinburghU. MichiganU. MinnesotaU. WashingtonYale

Page 17: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

17

Workplace environment

Page 18: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

18

Materials handled

78% handle both published and unpublished items

Page 19: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

19

Types of materials described

Page 20: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

20

Metadata description tools

• 65% Integrated Library System (Innovative Interfaces Millennium, Exlibris/Aleph or Voyager, VTLS/Virtua, SirsiDynix/Unicorn or Horizon, etc.)

• 41% Digital Collections software (CONTENTDM, Luna Insight, MDID, etc.)

• 31% Institutional Repository software (DSpace, Digital Commons, ePrints, etc.)

• 31% Collections Management system (TMS, KE Emu, Willoughby, etc.)

• 25% Digital Asset Management system (Portfolio, TEAMS, MediaBin, ClearStory, etc.)

• 17% Archival Management system (Archivists’ Toolkit, Archon, etc.

Page 21: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

21

Data structures used

Page 22: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

22

Thesauri and controlled vocabularies

• Two thirds or more use:o Library of Congress Subject Headings, LC/NACO Name Authority File,

Art & Architecture Thesaurus• More than 30% use:

o Getty Thesaurus of Geographic Names, Thesaurus of Graphic Materials I: Subject Terms, Thesaurus of Graphic Materials II: Genre and Physical Characteristic Terms, Union List of Artist Names

• More than 10% use:o RBMS Controlled Vocabularies for Use in Rare Book and Special

Collections Cataloging, Geographic Names Information Services, Local

• Others used: o ICONCLASS, MeSH, LC Moving Image Materials, NCA Rules for

construction of personal and place names, UK Archival Thesaurus for subjects, Chemical Markup Language Dictionaries, Chicago Thesaurus, Universal Decimal Classification for Polar Libraries, Linnean names, UNESCO Thesaurus, International Astronomical Union’s Astronomy Thesaurus, British Educational Thesaurus, Society of American Archivists, Glossary of Archives and Records Terminology, Integrated Taxonomic Information Systems for archeological faunal collections

Page 23: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

23

45% build and maintain one or morelocal thesaurus

(especially archives, museums, institutional repositories, digital libraries)

Page 24: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

24

Image: from the end of “Raiders of the Lost Ark”

Who knows what’s hidden in our collections?

Page 25: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

25

What percentage of your collection do you estimate has not been adequately described – and is unlikely to be described without additional resources, funding, or both?

More than 50%31% to 50%11% to 30%0% - 10%

RLG Programs Descriptive Metadata Practices Survey Results: Data Supplement http://www.oclc.org/programs/publications/reports/2007-04.pdf

18%

35% 24%

22%

Page 26: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

26

Sharing guidelines and strategies with other units

Page 27: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

27

RLG Partners Metadata Creation Workflows Survey

Working Group conducted 2008 survey of RLG Partner heads or directors of units responsible for creating non-MARC metadata, either solely or in addition to MARC metadata.

• 134 responses from 67 RLG Partner institutions. • Two-thirds used the same staff to create both MARC and non-MARC metadata.• 80% reported that creating non-MARC metadata was part of their

“routine workflows”.

Page 28: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

28

Crosswalks between schemas used(70% have conversion tools)

Page 29: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

29

Page 30: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

30

Page 31: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

31

Streamlining metadata creation workflows• Descriptive metadata elements• Procedures• Standards• Tools• Authority control• Rights management• Repurposing data• Capturing data from other systems

Page 32: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

Research

Harnessing the power of terminologies

Merrilee [email protected] Research

University of CalgaryFebruary 10, 2010

Page 33: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

33

Making soup from the alphabet

GSAFD FAST LCSH LCTGM LCNAF GMGPC MeSH AAT ULAN TGN

CSH

Page 34: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

34

Terminologies “summit,” September 2007• Establish priorities based on a “strawman”

• Search optimization• Support terminologies management & sharing (including

local terms)• Support “social interactions” that add value• Value added intelligence (creating relationships between

terminologies)

• 15 participants (45+ views!)• Lively discussion and prioritization

Page 35: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

35

Key findings

Enhancing search terms“End users should be our primary concern – if they

don’t find anything, they won’t be back” – Amy Lucker, New York University

Taking “local” global“No published terminologies are going to meet all

needs. The reason we have local terminologies is because contributing to published terminologies is so difficult.” Jenn Riley, Indiana University

Page 36: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

36

Enhancing search

• Using web services to…• Suggesting search terms• Adding related terms on the fly

• Depends on implementation (it’s up to you)

Query from institution in XML, passed to OCLC, XML passed back to institution, XML parsed by institution, presented in interface.

Page 37: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

37

Page 38: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

38

Page 39: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

39

Page 40: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

40

Page 41: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

41

Page 42: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

42

“Local” terminologies

• Additional interviews conducted with institutions who maintained “local” terminologies

• Largely unstructured, some based on existing structures (primarily AAT)

• Maintaining local terms because…• Barriers to contributing to existing structures• Perception that terms were truly local

• All interested in a cloud environment • Ease creation and maintenance of terms• Allow terms to flow into local systems • Facilitate contribution to established terminologies

Page 43: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

Research

Names, Identities Hub,VIAF, WorldCat Identities

Thom [email protected] Research

University of CalgaryFebruary 10, 2010

Page 44: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

44

Names touch everything….

AuthorsPublishersInstitutions

Musicians

Families

Government agencies

ArtistsFictional characters

Actors

Plant discoverers

Corporations

Pseudonyms

Illustrators

Editors

Translators

Correspondents

Historical figures

Directors

Scientists

Architects

Animals

Politicians

Inventors

Associations

Cartographers

Page 45: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

45

Names can be ambiguous…

“John Adams”… the US president? … the US composer?… the British

mathematician & astronomer?

… the British nuclear physicist?

… or someone else?

Page 46: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

46

Page 47: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

47

Networking NamesAdvisory Group

Page 48: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

48

Cooperative Identities Hub• Framework to concatenate and merge

authoritative information• Gateway to all forms of names without

preferring one form over another

• Use social networking model• Provide a switch to extract relevant information for re-use in own contexts• Create federated trust environment to authenticate and authorize contributors

Page 49: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

49

Hub objectives• Increase metadata creation efficiency• Easier to identify identity regardless of

language or discipline

• Determine preferred form within own context• Enable contributing agencies to augment own data resources

• Expose information about personal and corporate bodies beyond original contexts

Page 50: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

50

Hub functions• Searches by both people and software

applications

• Edit: Add information, merge, split, flag for deletion

• Create new entities

• Batch update

• Discussion

• Audit trail and rollback

Page 51: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

51

Virtual International Authority File

• Matches names across 20 authority files• 12.5 million records• 10 million names

Page 52: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

52

One persona, many representations

http://viaf.org/viaf/95216565

Page 53: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

53

Page 54: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

54

Other efforts

• ISNI• International Standard Name Identifier

• ORCID• Open Researcher and Contributor ID

• WorldCat Identities• A page for every name in WorldCat

Page 55: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

55

International StandardName Identifier• Driven by ‘rights’ holders• Need standard way to exchange information• OCLC

• Participating in standards effort• Matching of test files• VIAF may be used as base file

Page 56: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

56

Open Researcher and Contributor ID• Driven by scientific publishers• Need to exchange information, make systems

more usable• Mostly interested authors of journal literature• OCLC

• Participating in group• Possibly sharing metadata• Possibly working on matching

Page 57: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

57

WorldCat Identities

• A page for every name• 25 million personal names• 30 million names total• Includes imaginary characters, horses, etc.

Page 58: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

58

Page 59: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

59

Page 60: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

60

Page 61: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

61

Page 62: OCLC Research @ U of Calgary: New directions for metadata workflows across libraries, archives, museums

ResearchUniversity of Calgary – February 10th 2010

62

Collaboration Continuum