35
ALA2006 1 Faceted Application of Subject Terminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based Subject Vocabulary Ed O’Neill, OCLC Lois Mai Chan, University of Kentucky ALA Annual Conference New Orleans, June 24, 2006

ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

Embed Size (px)

Citation preview

Page 1: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 1

Faceted

Application of

Subject

Terminology

A Joint Research and Development Project by OCLC and the Library of Congress

A Faceted LCSH Based Subject Vocabulary

Ed O’Neill, OCLC

Lois Mai Chan, University of Kentucky

ALA Annual Conference

New Orleans, June 24, 2006

Page 2: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 2

Need for New Approach

Phenomenal growth of electronic resources

Emergence of numerous metadata schemes

Need for a new approach to subject access

Lack of skilled subject catalogers

Page 3: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 3

Subject Vocabulary for the Web

Optimal access points

Simple in structure and syntax

Usable by non-catalogers and in non-library environments

Semantic interoperability

Compatible with MARC, Dublin Core, and other popular metadata schemas

Easy maintainability

Amenable to computer-assisted authority control

Page 4: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 4

Options

The ALCTS/SAC/Subcommittee on Metadata and Subject Analysis(1997-2001) identified three basic approaches to selecting an indexing/subject heading schema for Internet resources:

Develop a new schema

Use an existing schema(s)

Adapt or modify an existing schema

Page 5: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 5

Subject Representation in Metadata

Issues considered:

Vocabulary (Semantics): Terminology and term relationships

Application (Syntax): How words are put together to form subject terms

Page 6: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 6

LCSH Vocabulary

Largest in English language

Rich vocabulary covering all subject areas

Synonym and homograph control

Extensive hierarchical and associative references among terms

De facto standard controlled vocabulary: extensively used by libraries, translated into many languages, and contained in millions of MARC records

Long and well-documented history

Strong institutional support of the Library of Congress

Page 7: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 7

LCSH Application Rules

The full-string approach to complex subjects is designed:

To ensure precision in retrievalTo facilitate browsing of multiple-concept

or multi-faceted subjects in the online catalog

Page 8: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 8

Application of LCSH on the Web

LCSH is not compatible in syntax with most other controlled vocabularies;

LCSH is not amenable to search engines outside of the OPAC environment

Few LCSH headings are established

Complex subject heading strings in bibliographic or metadata records are costly to maintain

LCSH does not lend itself to automatic indexing or authority control

The use of LCSH requires highly trained personnel

Page 9: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 9

What is FAST?

A rich controlled vocabulary based on the terminology of Library of Congress Subject Headings (LCSH)

A simplified application syntax

Page 10: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 10

Principles of FAST

A faceted approach by categorizing headings according to their functions

Retains the richness of the LCSH vocabulary in a simpler application syntax

Provides a tiered approach to allow different levels of subject representation

Page 11: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 11

Characteristics of FAST

Vocabulary: Enumerative vs. Faceting

Terms in same facet – enumerated Terms in different facets – listed separately

Retrieval: Precoordination and Postcoordination

Terms in same facet – precoordinated Terms in different facets - postcoordinated

Page 12: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 12

Vocabulary: Enumeration and Faceting

Headings in the FAST database include single-concept as well as multiple-concept headings.

Each FAST heading or heading-string belongs to a single facet

Page 13: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 13

Subject Analysis - FAST

Vocabulary construction – fully established headings maintained in FAST database

Cataloging/indexing – selecting appropriate headings from FAST database

Retrieval – supporting faceted searching

Page 14: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 14

Sources of FAST Headings

Library of Congress Subject Headings

Headings Assigned to Bibliographic Records in the WorldCat

Created Headings

Page 15: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 15

Faceting

Reduces the number of possible headings and heading strings

Permits independent use of headings

Headings are less volatile

~9,000,000 different LCSH topical headings in bibliographic records

~400,000 FAST topical headings

Fewer infrequently assigned headings

Supports faceted searches

Page 16: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 16

Eight Facets

Topical

Personal Names

Form (Genre)

Chronological

Corporate Names

Conference/MeetingsUniform Titles

Geographic

Page 17: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 17

Main headings

A FAST main heading contains a word or phrase representing a concept or entity that falls into one—and only one—of the eight FAST facets.

Banks and bankingBibliographyCaliforniaCatalogs1914 - 1918Chemistry, OrganicEmigration and immigrationSelf-esteemSpain

Page 18: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 18

Subdivisions

A heading string may contain one or more subdivisions belonging to the same facet as the main heading

Abortion—Law and legislation—Criminal provisionsAlcoholics—Services for—PlanningAmericans—Travel—HistoriographyAsians—Legal status, laws, etc. Bibliography—Union listsBrain—Cancer—Patients—Family relationships

California—San Francisco—ChinatownMichigan—Lake CharlevoixOhio—Columbus

Page 19: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 19

Modular Approach

Each facet forms a distinct and discrete list of headings in a separate file.

These lists may be used together or separately. In a particular application, not all facets are required. For example, in indexing a collection of naturally occurring objects, the chronological and personal name headings may not be applicable.

One or more of the facets may be used with other standard lists, for instance, using topical headings from FAST and geographic headings from the Getty Thesaurus of Geographic Names (TGN)

Page 20: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 20

All Headings Are Established

FAST uses the MARC 21 authority format

The MARC 21 bibliographic and authority formats were revised to accommodate FAST by authorizing the x48 (Chronological) fields

Assigning FAST headings doesn’t require an understanding of the rules for constructing headings

Authorities can serve as indexes

Automatic and/or machine assisted assignment possible

Page 21: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 21

Topical Headings

Secret service

Urbanization

Hospitals—Administration—Data processing

Cataloging—Analytical entry

Photoconductivity—Measurement

Woodwind trios (English horn, oboes (2))

Sailing—Safety measures

Page 22: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 22

Topical Authority Record

Page 23: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 23

Geographic Facet

Geographic names will be established and applied in indirect order, [Louisiana—New Orleans not New Orleans—Louisiana]

First level geographic names will be limited to names from the Geographic Area Codes table (e.g., Ohio, Victoria, Great Lakes, etc.) Other names will be entered as subdivisions under the smallest first level name in which it is fully contained [Europe—Curzon Line]

Bodies of water (Bays, Gulfs, etc.) that are part of oceans are established under the larger body of water [Atlantic Ocean—Chesapeake Bay not Chesapeake Bay (Md. and Va.)]

Geographic Area Codes are included in all authority records for geographic names

Page 24: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 24

Geographic Headings

Queensland [u-at-qn]Mars [zma]Maryland—Worcester County [n-us-md]Slovenia—Maribor [e-xv]Norway—Oslo Metropolitan Area [e-no]England—Chilton (Oxfordshire) [e-uk-en]India—Limbdi (Princely State) [a-ii]New South Wales—Sydney—Bondi [u-at-ne]Pacific Ocean—Rowan Bay [p]

Page 25: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 25

Geographic Authority Record

Page 26: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 26

Form (Genre) Headings

Case studiesAbstractsCensus RulesDictionariesFolkloreBibliography—CatalogsPeriodicals Guidebooks

Page 27: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 27

Personal and Corporate Names

Headings for persons:Woodward, BobDewey, Melvil, 1851-1931Kennedy familyCharles II, King of France, 823-877

Headings for corporate bodies:OCLCFord Motor Company United States. National Security Agency Dixie Chicks (Musical group)

Page 28: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 28

Chronological (Period)

FAST chronological headings consist of only a single date or a date rangeLimited to a single chronological heading per bibliographic recordAuthority records will only be established when needed for references or linkagesHeadings consist of either a single date or a starting and ending date but will be formatted for display: 1945 1942 – 1945 Since 1987 221 B.C. - 220 A.D.

Page 29: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 29

LCSH to FAST Conversion

600 Lincoln, Abraham, $d 1809-1865

648 1861 - 1865

650 Political leadership

650 Genius

650 Friendship

650 Presidents

650 Political science

651 United States

655 Case studies

655 Biography

FA

ST

600 Lincoln, Abraham, $d 1809-1865

650 Political leadership $z United States $v Case studies

650 Genius $v Case studies

600 Lincoln, Abraham, $d 1809-1865 $x Friends

and associates

650 Presidents $z United States $v Biography

651 United States $x Politics and government $y 1861-1865

LCS

H

Page 30: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 30

Databases

The FAST database is available as an OCLC SiteSearch database at http://fast.oclc.org

The database may be unavailable for extended periods

This version of FAST is being applied and evaluated in a few applications

The Subject Analysis Committee has established a Subcommittee on FAST to provide guidance and evaluation

Page 31: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 31

Current FAST Database

Personal name headings 510,095Corporate name headings 283,581Topical headings 412,709Geographic name headings 148,960Form headings 694 Total FAST authorities 1,356,039

Page 32: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 32

Future Development Plans

Update and resynchronize all FAST headings with LCSH

Develop the conference/meetings facet

Develop the uniform titles facet

Expand the geographic names based on usage data and add information from the Geographic Names Information System (GNIS)

Revise and expand the form (genre) facet

Complete the FAST manual

Page 33: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 33

Advantages of FAST

Reduces elaborate heading construction rules for catalogers and indexers; heading construction is at vocabulary rather than application level

Is able to accommodate both precoordinate and postcoordinate indexing and retrieval

Is more amenable to computer-assisted indexing and authority control

Is easier and more economical to maintain than a highly enumerative vocabulary

Facilitates mapping of subject data and cross-domain searching

Accommodates different retrieval models

Page 34: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 34

Summary

LCSH Vocabulary

Faceted

Hierarchical

Fully established

Compatible with LCSH

LCSH Vocabulary

Faceted

Hierarchical

Fully established

Compatible with LCSH

Page 35: ALA20061 F aceted A pplication of S ubject T erminology A Joint Research and Development Project by OCLC and the Library of Congress A Faceted LCSH Based

ALA2006 35

Questions?

[email protected]

[email protected]

http://fast.oclc.org