Upload
aaron-parrish
View
213
Download
0
Tags:
Embed Size (px)
Citation preview
FAST (Faceted Application of Subject Terminology): A Vocabulary to Facilitate Faceted Browsing
FAST (Faceted Application of Subject Terminology): A Vocabulary to Facilitate Faceted Browsing
Eric ChildressConsulting Project ManagerOCLC Programs & Research
Everything Need Not Be Miscellaneous: Controlled Vocabularies And Classification In A Web World
Sponsored by OCLC, ISKO-NA and Université de Montréal
Université de Montréal
5 August 2008
OCLC / ISKO-NA Preconference
OutlineOutline
• A Changing World
• About FAST
• Illustrative uses of FAST
A Changing WorldA Changing World
• Search & Retrieval evolves
• Interface expectations are set by others
• Presentation and navigation patterns flow…
• Data/Metadata production sources expand
• Machine services are increasingly important
Search & Retrieval evolvesSearch & Retrieval evolves
19th Century
Author, Title, Subject
Card Catalogs
Manual
Complex, Non-intuitive
Controlled Vocabulary
Library Library catalogscatalogs
21st Century
Key Word, Relevance Weighted
Computers/Web
Automated & Social
Simpler, User-Friendly
Uncontrolled Vocabulary
low unit costshigh unit costs
Interface expectations are set by othersInterface expectations are set by others
Top Web properties by visitor June 2008 (SEW)
Presentation and navigation patterns flow…Presentation and navigation patterns flow…
Losing favor:
•Text-heavy interfaces
•All options, all the time
•Jargon-laden labels
Gaining favor:
•Single-search box simplicity
•White-space and eye-catching graphics
•Side-bar navigation
•Tagclouds
Closer to, but not libraries…
Traditional library interfaces
Newer library system interfaces
Tagclouds
Data/Metadata production sources expandData/Metadata production sources expand
Institution-based
Sophisticated rules
Expert-built
Authority
Formal distribution channels
Institutional networks
Chiefly individuals
No/lightweight rules
UGC (User-Generated Content)
Wisdom of the crowds
“Metadata in the wild”
The Network
cost diffusioncost accounting
Machine services are increasingly importantMachine services are increasingly important
About FASTAbout FAST
• Arose from expert recommendations
• Project of OCLC Research in consultation with LC
• Complement of faceted vocabularies based on LCSH
• Suitable for non-expert application
• Machine-to-machine friendly
A New Frontier for Controlled VocabulariesA New Frontier for Controlled Vocabularies
Expert group studied issues related to controlled vocabularies and the Web environment
• American Library Association’s ALCTS/SAC/Subcommittee on Metadata and Subject Analysis (1997-2001)
Conclusion: For certain circumstances a controlled vocabulary was needed which was:
• Web-friendly
• Low-learning curve/non-expert user-friendly
Existing major vocabularies were not ready as-is
Adapting an existing vocabulary might be a pragmatic option
Requirements for a New VocabularyRequirements for a New Vocabulary
• Simple in structure and syntax
• Usable by non-catalogers and in non-library environments
• Compatible with MARC, Dublin Core, and other popular metadata schemas
• Easy maintainability
• Machine-compatible
Launching FASTLaunching FAST
OCLC Research project launched in 1998
Advisory group: ALA/ALCTS/SAC Subcommittee on FAST (Faceted Application of Subject Terminology)
Team:
OCLC: Eric Childress, Kay Clapton, Becky Dean, Anya Dyer, Kerre Kammerer, Ed O’Neill (Lead), Diane Vizine-Goetz
LC CSPO: Lynn El-Hoshy (now retired), Janice Young
Consultant: Lois Mai Chan (University of Kentucky)
Why Adapt LCSH?Why Adapt LCSH?
• Rich vocabulary covering all subject areas
• Synonym and homograph control
• Extensive hierarchical and associative references among terms
• De facto standard controlled vocabulary, extensively used by libraries, contained in millions of bibliographic records
• Long and well-documented history
• Strong institutional support of the Library of Congress
What is FAST?What is FAST?
• OCLC FAST (Faceted Application of Subject Terminology)
• A faceted vocabulary based on LCSH
• Modular – each facet may be used independently
• Supports post-coordinate search & retrieval
• Designed for use by non-expert assigners
• Machine-friendly controlled vocabulary
Authority Control: LCSH vs. FAST Authority Control: LCSH vs. FAST
LCSH FAST
Very large number (billions plus) of possible headings
Faceting limits the number of possible headings to a few million
Common headings are established; most assigned headings are synthesized by catalogers based on rules
All headings (except chronological) are established
Most headings are distinct (based on NACO normalization rules*); some conflicts occur particularly with $x & $v
All normalized headings are distinct; tagging and subfield coding provides no unique information (with the exception of forms)
*http:\\www.loc.gov/catdir/pcc/naco/normrule.html
FAST - Eight FacetsFAST - Eight Facets
Topical
Subject headings ―Evaluation
Form (Genre)
Guidebooks
Chronological
1939 - 1945
Geographic
New York (State) ―New York
Personal Names Kilgour, Frederick G.
Corporate Names
Oregon Library Association
Events
Olympic Games
Uniform Titles Dead Sea scrolls
How is FAST built?How is FAST built?
• OCLC Research-built software processes LCSH authority file and LCSH present in WorldCat bibliographic records to automatically build:
• FAST MARC authority file covering 8 facets
• FAST MARC authority reference records to assist with LCSH conversion
Sample Authority Record - GeographicSample Authority Record - Geographic
001 2130675003 OCoLC 005 20040512160245.0008 040512nneanz||babn n ana d 040 OCoLC $b eng $c OCoLC $f fast043 n-us-ak151 Pacific Ocean $z Rowan Bay670 GNIS, Feb. 10, 2004 $b (Rowan Bay; bay;7 mi. N of Tebenkof Bay, on W coast of Kuiu I., Alex. Arch.; Wrangell-Petersburg Census Area, Alaska;56º40'02" N, 134º14'34" W; another Rowan Bay, pop. place in Wrangell-Petersburg Census Area)688 LC subject usage: 0 (2006)688 WC subject usage: 2 (2006)751 0 Rowan Bay (Alaska : Bay)$0 (DLC)sh2004005090
Sources of FAST HeadingsSources of FAST Headings
Library of Congress Subject Authority File
LC headings that combine different facets are deconstructed into discrete headings, each containing only one facet.
Headings assigned to bibliographic records in OCLC’s WorldCat
Many complex headings, i.e., those containing more than one element in the heading string, are based on literary warrant. They are derived from subject fields in the records in OCLC’s WorldCat.
Headings created for FAST
In some cases, faceting has required FAST headings to be created when no LCSH equivalents exists.
Example of a FAST-only heading – Events FacetExample of a FAST-only heading – Events Facet
In LCSH, it is common to established events as a combination of a geographic heading and a chronological ($y) subdivision:
Buffalo (N.Y.) $x History $y Civil War, 1861-1865
Grenada $x History $y American Invasion, 1983
For these subdivisions, a FAST topical heading is also created:
American Civil War, 1861-1865American Invasion of Grenada, 1983
LCSH to FAST ComparisonLCSH to FAST Comparison
600 Lincoln, Abraham, $d 1809-1865
648 1861 - 1865
650 Political leadership
650 Genius
650 Friendship
650 Presidents
650 Political science
651 United States
655 Case studies
655 Biography
FA
ST
600 Lincoln, Abraham, $d 1809-1865
650 Political leadership $z United States $v Case studies
650 Genius $v Case studies
600 Lincoln, Abraham, $d 1809-1865 $x Friends
and associates
650 Presidents $z United States $v Biography
651 United States $x Politics and government $y 1861-1865
LCS
H
Current FAST Authority FileCurrent FAST Authority File
Personal name headings 699,200
Corporate name headings 351,494
Topical headings 407,772
Geographic name headings 148,952
Chronological headings 676
Event headings 12,225
Title headings 48,245
Form headings 711
Total FAST authorities 1,669,375
July 2008
Future Development PlansFuture Development Plans
• Update and resynchronize all FAST headings with LCSH (In process)
• Improve the LCSH to FAST conversion (In process)
• Complete the FAST manual (In process)
• Expand the geographic names based on usage data and add information from the Geographic Names Information System (GNIS)
• Revise and expand the form (genre) facet
Sample OCLC applications of FASTSample OCLC applications of FAST
WorldCat Identities
FictionFinder
FA
ST h
ead
ing
s
FA
ST h
ead
ing
s
http://fictionfinder.oclc.org/
More informationMore information
OCLC FAST
• Project page: http://www.oclc.org/research/projects/fast/
• Search interface: http://fast.oclc.org/
OCLC WorldCat Identities
• http://orlabs.oclc.org/Identities/
OCLC FictionFinder
• http://fictionfinder.oclc.org/