UNC visit

Preview:

DESCRIPTION

Slides prepared for a guest appearance at Jane Greenberg's metadata class at the University of North Carolina, Chapel Hill. Delivered Monday, Dec. 6, 2010.

Citation preview

Libraries, Standards, and the Web

Where to Aim Our Eyes and Attention

What We’ll Talk About

Part 1:What’s up with Library Standards? Why is there so much confusion? How will we move forward? Where’s the leadership? The

participation? Part 2: The RDA Vocabs, a case study

What can we learn from this? Can we replicate this model? How? Stepping up to the plate …UNC Visit, 12/6/10 2

Describing Library Standards

Library standards can be categorized to help us understand how they fit together

In general, functional categories are the most useful

Sometimes what the standards-makers assert isn’t the whole story …

UNC Visit, 12/6/10 3

What kind of standards?

Data Structure Standards (element sets; schemes, schemas, or schemata) (examples: Dublin Core, MODS, CDWA, VRA, IEEE-LOM, RDA Vocabularies)

Data Content Standards (cataloging rules, input standards, best practice guides) (examples: AACR2, CCO, CDPDCMBP, RDA Text)

Data Value Standards (controlled vocabularies, encoding schemes) (examples: LCSH, AAT, TGN, LCTGM, ULAN, W3CDTF, DCMIType)

Data Format / Technical Interchange Standards (encoding standards for machine processing and interchange) (examples: XML, SGML, MARC)

Data Presentation Standards (examples: ISBD punctuation, CSS and/or XSLT for display, OPAC display settings) 

UNC Visit, 12/6/10 4

Questions to Ponder

How can MODS (a derivative of MARC) be a Data Structure Standard if MARC isn’t?

Should MARC be categorized along with SGML (largely used for text markup)? MARC’s data model started out as the

catalog card What are the most important

criteria for this categorization? UNC Visit, 12/6/10 5

Tectonic Shifts Ahead …

LITA surveyed its membership about their thinking about standards

LITA as a TF working on some recommendations aimed at increasing participation by librarians in standards activities NISO/ISO has traditionally been LITA’s focus in

the standards world—that is likely to broaden ISO increasingly seen as problematic because

of their business model Many web standards coming out of W3C

UNC Visit, 12/6/10 6

Librarians & Standards

• The LITA survey confirms that lack of participation by librarians has to do with: ▪ institutional support (time, money) ▪ individual feelings of competence

The changes in the environment challenge us all to re-think how we relate to standards▪ … as well as who we think should be in charge

of building them

UNC Visit, 12/6/10 7

RDA Vocabularies

The real story

Where is this taking us?

Join the party!

UNC Visit, 12/6/10 8

A Brief History

It all started in London, the last day of April 2007 …

UNC Visit, 12/6/109

What Was Accomplished

The participants agreed that DCMI and the Joint Steering Committee for the Development of RDA should work together to: Develop an RDA Element Vocabulary Expose RDA Value Vocabularies Develop an RDA Application Profile, based

on FRBR and FRAD The first two are largely complete; the

third is startedUNC Visit, 12/6/10 10

The General Strategy

We used the Semantic Web as our “mental model”

Wanted to create a “bridge” between XML and RDF to support innovation in the library community as a whole, not just those at the cutting edge or the trailing edge

We registered the FRBR entities as classes in a ‘FRBR in RDA’ vocabulary, to enable specific relationships between RDA properties and FRBR

IFLA has followed suit using the Open Metadata Registry to add the ‘official’ FRBR entities, FRAD, and ISBD

This provides exciting opportunities to relate all the vocabularies together

UNC Visit, 12/6/10 11

Structure: Rationale & Decisions

Property and value vocabularies registered on the Open Metadata Registry (formerly the NSDL Registry): http://metadataregistry.org/rdabrowse.htm

Used RDF Schema (RDFS), Simple Knowledge Organisation System (SKOS) and Web Ontology Language (OWL) Registry provides human and machine usable

interfaces All vocabularies have change history and

versioning capabilitiesUNC Visit, 12/6/10 12

Our Methodology

Started with the Entity Relationship Diagrams produced by ALA Publishing The latest iterations are available on the RDA

Toolkit Site (http://www.rdatoolkit.org/background)

ERDs are organized in three groups: core, enhanced, special

These were developed and iterated with no change management strategy, so each new iteration had to be checked carefully to spot changes

ERDs built with a very XML view of the worldUNC Visit, 12/6/10 13

The Basic ‘WHY’?

We think of the ‘generalized’ RDA properties as the real RDA vocabulary The ‘bounded’ properties should be seen as the

first pass at an Application Profile Extensions can be built more usefully from the

generalized properties Mapping will be cleaner using the generalized

properties (since most properties mapped to or mapped from will not be based on FRBR)

Generalized properties much more acceptable to non-library implementers (not often using FRBR)

UNC Visit, 12/6/10 14

Taking a Look …

FRBR in RDA Vocabulary declared as classes

RDA Properties declared as a ‘generalized’ vocabulary, with no explicit relationship to FRBR entities

Subproperties for the generalized elements may be explicitly related to FRBR entities (using ‘domain’) Label/Name includes (Work) or other class to

provide unique name (unless the entity name already appears in the name of the property)UNC Visit, 12/6/10 15

Property (Generalized, no FRBR relationship)

Subproperty (with relationship to one FRBR entity)

FRBR Entity

SemanticWeb

Library ApplicationsThe Simple Case:

One Property-- One FRBR Entity

UNC Visit, 12/6/10 16

Book format

Book format (Manifestation)

Manifestation

SemanticWeb

Library ApplicationsThe Simple Case:

One Property-- One FRBR Entity

UNC Visit, 12/6/10 17

UNC Visit, 12/6/10 18

http://RDVocab.info/Elements/bookFormatManifestation

Property (Generalized, no FRBR relationship)

Subproperty (with relationship to one FRBR entity)

Subproperty (with relationship to one FRBR entity)

FRBR Entity

FRBR Entity

SemanticWeb

Library ApplicationsThe Not-So-Simple Case: One Property—more than

One FRBR Entity

UNC Visit, 12/6/10 19

Extent

Extent (Item)

Extent (Manifestation)

FRBR Item

FRBR Manifestation

SemanticWeb

Library ApplicationsThe Not-So-Simple Case: One Property—more than

One FRBR Entity

UNC Visit, 12/6/10 20

UNC Visit, 12/6/10 21

Roles: Attributes or Properties? In 2005, the DC Usage Board worked with

LC to build a formal representation of the MARC Relators so that these terms could be used with DC

This work provided a template for the registration of the role terms in RDA (in Appendix I) and, by extension, the other RDA relationships Role and relationship properties are registered

at the same level as elements, rather than as attributes (as MARC does with relators, and RDA does in its XML)

UNC Visit, 12/6/10 22

“Super” Property

Subproperty (with relationship to one FRBR entity)

Subproperty (Generalized, no FRBR relationship)

FRBR Entity

SemanticWeb

Library Applications The Roles Case: Properties, Subproperties

and FRBR Entities

Mapping,Etc.

UNC Visit, 12/6/10 23

RDA:Creator

RDArole:Composer (Work)

RDArole:Composer

Work

SemanticWeb

Library Applications The Roles Case: Properties, Subproperties

and FRBR Entities

Mapping,Etc.

UNC Visit, 12/6/10 24

Aggregated Statements RDA sets up Publication, Distribution, Manufacture

and Production statements very much the way they have been done since catalog card days:Assumed aggregation of Place, Name and Date are

obvious leftovers from catalog cards, and are not necessary to enable indexing or display of those elements together if libraries want to do that

We viewed those aggregations as ‘Syntax Encoding Schemes’ and built in ways to accommodate them within the bounded propertiesThose using the generalized properties (outside

libraries, usually) need not be constrained by these traditional aggregations of properties

UNC Visit, 12/6/10 25

What Does This Structure Buy Us?

Release from the tyranny of records Potential for use with a variety of

encodings Opportunity to re-think how we build

and share data Potential for sharing data beyond the

library silo A challenge to our old notions of

what library data can do and should be doingUNC Visit, 12/6/10 26

UNC Visit, 12/6/10 27

ID=23456 hasAuthor “Kurt Vonnegut”

ID=23456 hasPreferredTitle “Bluebeard,

a novel”

ID=23456 isFormOfWork “Novel”

ID=23456 hasOriginalLanguage

“English”

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”

ID=23456 hasL

anguageOfExp

ress

ion

“Englis

h”

ID=23456 hasPublisher “Delacorte

Press”ID=23456 hasPublicationDate

“1987”

Statements on the Floor?

UNC Visit, 12/6/10 28

ID=23456 hasAuthor “Kurt Vonnegut”

ID=23456 hasPreferredTitle “Bluebeard,

a novel”

ID=23456 isFormOfWork “Novel”

ID=23456 hasOriginalLanguage

“English”

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”

ID=23456 hasLanguageOfExpression

“English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

Is This Really Chaos?

UNC Visit, 12/6/10 29

ID=23456 hasAuthor “Kurt Vonnegut”

ID=23456 hasPreferredTitle “Bluebeard,

a novel”

ID=23456 isFormOfWork “Novel”

ID=23456 hasOriginalLanguage

“English”

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”

ID=23456 hasLanguageOfExpression

“English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

Or Just an AggregationIn the Making?

UNC Visit, 12/6/10 30

ID=23456 hasAuthor “Kurt Vonnegut”

ID=23456 hasPreferredTitle “Bluebeard, a novel”

ID=23456 isFormOfWork “Novel”

ID=23456 hasOriginalLanguage

“English”

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”

ID=23456 hasLanguageOfExpression

“English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

UNC Visit, 12/6/10 31

ID=23456 hasAuthor “Kurt Vonnegut”

ID=23456 hasPreferredTitle “Bluebeard, a novel”

ID=23456 isFormOfWork “Novel”

ID=23456 hasOriginalLanguage “English”

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”

ID=23456 hasLanguageOfExpression

“English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

UNC Visit, 12/6/10 32

ID=23456 hasAuthor “Kurt Vonnegut”

ID=23456 hasPreferredTitle “Bluebeard, a novel”

ID=23456 isFormOfWork “Novel”

ID=23456 hasOriginalLanguage “English”

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”ID=23456 hasLanguageOfExpression

“English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

UNC Visit, 12/6/10 33

ID=23456 hasAuthor “Kurt Vonnegut”

ID=23456 hasPreferredTitle “Bluebeard, a novel”

ID=23456 isFormOfWork “Novel”

ID=23456 hasOriginalLanguage “English”

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”ID=23456 hasLanguageOfExpression

“English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

Work

UNC Visit, 12/6/10 34

ID=23456

hasPlaceOfPublication “New

York”

ID=23456

hasStatementOfEdition “1 st

trade edition”

ID=23456 hasLanguageOfExpression “English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

UNC Visit, 12/6/10 35

ID=23456

hasPlaceOfPublication “New

York”

ID=23456 hasStatementOfEdition “1st

trade edition”

ID=23456 hasLanguageOfExpression “English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

UNC Visit, 12/6/10 36

ID=23456

hasPlaceOfPublication “New

York”

ID=23456 hasStatementOfEdition “1st

trade edition”

ID=23456 hasLanguageOfExpression “English”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

Expression

UNC Visit, 12/6/10 37

ID=23456 hasPlaceOfPublication “New

York”

ID=23456 hasPublisher “Delacorte

Press”

ID=23456 hasPublicationDate

“1987”

UNC Visit, 12/6/10 38

ID=23456 hasPlaceOfPublication “New

York”

ID=23456 hasPublisher “Delacorte Press”

ID=23456 hasPublicationDate

“1987”

UNC Visit, 12/6/10 39

ID=23456 hasPlaceOfPublication “New

York”

ID=23456 hasPublisher “Delacorte Press”

ID=23456 hasPublicationDate “1987”

UNC Visit, 12/6/10 40

ID=23456 hasPlaceOfPublication “New

York”

ID=23456 hasPublisher “Delacorte Press”

ID=23456 hasPublicationDate “1987”

Manifestation

Extension

The inclusion of generalized properties provides a path for extension of RDA into specialized library communities and non-library communities They may have a different notion of how FRBR

‘aggregates’; for example, a colorized version of a film may be viewed as a separate work

They may not wish to use FRBR at all They may have additional properties to include,

that have a relationship to the RDA properties

UNC Visit, 12/6/10 41

RDA:adaptedAs

RDA:adaptedAsARadioScript

hasSubprope

rty

UNC Visit, 12/6/10 42

RDA:adaptedAs

RDA:adaptedAsARadioScript

KidLit:adaptedAsAPictureBook

hasSubproperty

hasSubprope

rtyUNC Visit, 12/6/10 43

RDA:adaptedAs

RDA:adaptedAsARadioScript

KidLit:adaptedAsAPictureBook

hasSubproperty

hasSubprope

rty

KidLit:adaptedAsAChapterBook

hasS

ubprop

e

rty

UNC Visit, 12/6/10 44

Completing the Vocabularies Completing the hierarchies, both

generalized and FRBR-bounded Elements and Relationships need to have

bounded hierarchies built (generalized hierarchies complete)

Roles need generalized properties created JSC review incomplete, for both

properties and vocabularies Status designations need to be updated

from ‘New—proposed’ to ‘Published’UNC Visit, 12/6/10 45

ExtentExtent

(I)Extent

(M)

Extent of Text (M)

Extent of Still Image (M)

Extent of Text

Extent of Still Image

Extent of Text (I)

Extent of Still Image (I)

Subproperty

Subproperty

Subproperty

Subproperty

Subproperty

Current Registered RelationshipsUNC Visit, 12/6/10 46

ExtentExtent

(I)Extent

(M)

Extent of Text (M)

Extent of Still Image (M)

Extent of Text

Extent of Still Image

Extent of Text (I)

Extent of Still Image (I)

Subproperty

Subproperty

Subproperty

Subproperty

Subproperty

Added Registered Relationships

SubpropertySubproperty

UNC Visit, 12/6/10 47

Remaining Issues

How do these relate to the RDA guidance text? Who will maintain these? How will they be

kept in sync with the text? Will the governance model for these be

the same as the text? Who decides? How can we use these vocabularies

effectively? What else do we need to identify? How should we continue this work?

UNC Visit, 12/6/10 48

What We’ve Learned

We wrote about the decisions we made for RDA in DLib:http://dlib.org/dlib/january10/hillmann/01hillmann.html

Need to continue to disclose what we’ve learned and work on building best practices documentation in this environment

UNC Visit, 12/6/10 49

Issues in Limbo: Identification Traditional identifiers, like ISBN and ISSN, may

not be suitable for a different environment An ISBN is a publisher’s identifier for a product, may

not be precise enough for a manifestation (publishers sometimes reuse ISBNs for different versions)

OCLC numbers and LCCNs, because they’re optimized for a MARC environment, may not be what we need going forward (though useful for a transition?)

If we move from a centralized to a decentralized sharing environment, what kind of identifier system will we need?

UNC Visit, 12/6/10 50

RDA Identification Issues

Resource identifiers Record level identifiers

What is a record in this context? ▪ Is it a description of an entity (work,

manifestation, person, event)?▪ Is it an aggregation of all these that is

designed to be shared?▪ Is it both? And more?

Identifiers between resources What about the ones we already use?

UNC Visit, 12/6/10 51

IDENTIFICATION

UNC Visit, 12/6/10 52

The W3C Library Linked Data Effort

http://www.w3.org/2005/Incubator/lld/ “The group will explore how existing building

blocks of librarianship, such as metadata models, metadata schemas, standards and protocols for building interoperability and library systems and networked environments, encourage libraries to bring their content, and generally re-orient their approaches to data interoperability towards the Web, also reaching to other communities. It will also envision these communities as a potential major provider of authoritative datasets (persons, topics...) for the Linked Data Web.”

UNC Visit, 12/6/10 53

Exhortations

Engage: be part of the change we need! Participate: find the area you’re most

excited about, and find a way to get involved Mailing lists, blog comments, conferences,

etc., are all available Learn: take responsibility for your own

education Local study groups are a great option when

institutions aren’t providing trainingUNC Visit, 12/6/10 54

Thank you! Questions?

Contact info: metadata.maven@gmail.com

Metadata Matters: http://managemetadata.com/blog

UNC Visit, 12/6/10 55

Recommended