54
Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Dr. Barbara B. Tillett Chief, Policy & Standards Division Library of Congress For ELAG, May 2011

Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Embed Size (px)

DESCRIPTION

Chcete vědět víc? Mnoho dalších prezentací, videí z konferencí, fotografií i jiných dokumentů je k dispozici v institucionálním repozitáři NTK: http://repozitar.techlib.cz Would you like to know more? Find presentations, reports, conference videos, photos and much more in our institutional repository at: http://repozitar.techlib.cz/?ln=en

Citation preview

Page 1: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Building Blocks for the Future: Making Controlled

Vocabularies Available for theSemantic Web

Dr. Barbara B. TillettChief, Policy & Standards Division Library of CongressFor ELAG, May 2011

Page 2: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

DBpedia

National Library of Sweden

Linked Data LCSH

VIAF

Page 3: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Internet “Cloud”

Databases, Repositories

Web frontend

Services

3

Page 4: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Internet “Cloud”

Web frontend

ServicesVIAF

Databases, Repositories

LCSH

4

Page 5: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

5

VIAF Objectives

Facilitate exposure of authority data Reduce cataloging costs Simplify authority control (creation

and maintenance) internationally Provide authority data in form,

language, and script users want

Page 6: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

VIAF

6

歌 川 , 広重 2 世 1826-1869  

Utagawa, Hiroshige, 1826?-1869

Page 7: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

7

VIAF: The Virtual International Authority File

Original VIAF partners Library of Congress (LC) Deutsche Nationalbibliothek (DNB) Bibliothèque nationale de France (BnF) OCLC - host

Virtually combining the name authority files of all institutions into a single name authority service.

http://viaf.org/

Page 8: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

8

Virtual International Authority File

Matches names across 21 authority files of 18 institutions 18.4 million name records 14.5 million clusters

Based on KSY Cooperative Identities Hub, CEAL 2010-03

Page 9: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

9

•  Library of Congress/NACO • Deutsche Nationalbibliothek •   Bibliothèque nationale de France • National Library of Australia •   National Library of the Czech Republic •   Bibliotheca Alexandrina (Egypt) •   Getty Research Institute • National Library of Israel •   Istituto Centrale per il Catalogo Unico (Italy) •   Biblioteca National de Portugal •   Biblioteca Nacional de España •   National Library of Sweden •   Swiss National Library •   Vatican Library •   NUKAT Center (Poland) •   Library and Archives Canada •   National Széchényi Library (Hungary) • RERO (Switzerland)

Page 10: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

10

Current StatusAvailable as linked data with

URIs (Universal Resource Identifiers)

Unicode throughoutMARC 21, UNIMARC, and RDF

supportedUsage tripled this last year

Thousands of visits daily

Page 11: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Enhancing the Authorities

Bibliographic

Record

Derived Authorit

y

AuthorityRecord

Enhanced

Authority

11

Page 12: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Mining the Bibliographic Record LDR 00638ncm a22002057a 450 1 5773347 5 19960820101947.4 8 960815s1965 oruuua n eng 10 $a 96753638 040 $a DLC $c DLC019 $a 17706440020 $c $2.95028 22 $a 48418 $b Matrix Publ. Co. 045 2 $b d198006 $b d198007048 $b va01 $b ve01 $a ka01050 00 $a M1258 $b .L100 1 $a Leigh, Mitch, $d 1928-245 14 $a The man of La Mancha / $c by Mitch Leigh & Joe Darion; arr. By Roland Barrett & Alan Keown.260 $a Springfield, OR : $b Matrix Publ. Co., $c c1965.300 $a 1 score (16 p.) ; $c 18 x 27 cm.500 $a Brief record.650 0 $a Musicals $x Excerpts.600 10 $a Leigh, Mitch $x Musical settings.700 1 $a Darion, Joe.

Authors

LC Control Number

LC ClassificationTitl

e

Material Type

Publisher

Place of Publication

Language

Date ofPublication

Usage

Page 13: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Derived Authority Record

00505cz a2200157n 450 0 1 xlc 1 1 3 OCoLC 2 5 19880921165012.4 3 8 880831n|acannaab|n aaa c 4 040 $a OCoLC $b eng $c OCoLC $f viaf 5 100 1 $a Leigh, Mitch. 6 903 $a 88030979 7 910 14 $a the man of la mancha 8 921 $a matrix publ co 9 922 $a oru10 930 $a mitch leigh11 940 $a eng12 942 $a 23413 943 $a 196x14 944 $a cm15 950 1 $a darian, joe $d 1928-

All text is normalized

Subjects are grouped into

broad subject areas

Material type is coded

Publication date is by decadeCoauthor

Page 14: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Enhanced Authority Record00505cz a2200157n 450 0 1 oca01144962 1 5 19880921165012.4 2 8 840702n| acannaab| |n aaa ||| 3 10 $a n 88090379 4 40 $a DLC $c DLC $d DLC 5 100 1 $a Leigh, Mitch, $d 1928- 6 670 $a the man of la mancha, c1966: $b t.p. (Mitch Leigh) 7 903 $a 84758340 $9 1 8 903 $a 93710923 $9 1 9 910 11 $a impossible dream $9 110 910 11 $a century library of music and sound by mitch leigh $9 111 921 $a matrix publ co $9 112 921 $a kapp $9 213 922 $a oru $9 214 930 $a mitch leigh $9 115 940 $a eng $9 216 942 $a 234 $9 217 943 $a 196x $9 118 943 $a 197x $9 119 944 $a cm $9 220 950 11 $a darian, joe $d 1928- $9 121 950 11 $a wasserman, dale $9 1

Page 15: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

15

Information in Bibliographic Records He writes music

His primary subject area is music He was published in the 1960s and

1970s by Matrix Publ. Co. in Oregon and Kapp in New York

Worked with Joe Darion and Dale Wasserman

Mitch Leigh is the only name he has used on his publications

Etc.

Page 16: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

16

http://www.viaf.org

Hosted by

Page 17: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

17

viaf.org

Page 18: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes Saavedra, Miguel de 1547Cervantes de Salazar, Francisco, ca. 1514Cervantes, 1823-1898Cervantes Juan, 1395-1458Cervantes, Ignacio, 1847-1905Cervantes, Juan de, 1382-1453Cervantès, François, 1959-Cervani, Giulio, 1919-Cervantes, María AntonietaCervantes de Haro, fl. 1908-193-

As viewed Nov. 1, 2010

cer

Page 19: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 20: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 21: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Preferred Forms

Page 22: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 23: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 24: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 25: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 26: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 27: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Cervantes

Page 28: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

MA

RC

21

Cervantes

Page 29: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

RDF

Cervantes

Page 30: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

30

VIAF and Catalogers Use as a reference tool:

To resolve conflicts, questionable dates, forms of name, etc.

Cite as source in 670 $a, for example:BNF in VIAF, date searchedNat. Lib. of Australia in VIAF,

date searchedLAC in VIAF, date searched

Page 31: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

31

Next steps for VIAF Better searching More “Linked data”

Related persons as in WorldCat Identities, Wikipedia, etc.

Participants beyond librariesRights management agencies,

PublishersMuseums, Archives

More name typesCorporate and Family namesUniform titlesGeographic names… not topical terms

Page 32: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

32

SKOS

Simple Knowledge Organization System“Provides a model for expressing the

basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, folksonomies, and other similar types of controlled vocabulary”—SKOS Primer

Page 33: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

33

SKOS

Based on the Resource Description Framework (RDF)Resources can be exchanged

between software applications and published on the Web

Interconnects data on the Web, helping create the Semantic Web

Page 34: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

34

id.loc.gov/authorities

“Authorities & Vocabularies” from the Library of Congress

Intent: To provide human and programmatic access to commonly found standards and vocabularies developed by LC

Page 35: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

35

“Authorities & Vocabularies”LCSH was the first offering

Subject headingsGenre/form headingsChildren’s subject headingsSubdivision recordsValidation records

Provides links from LCSH headings to RAMEAU headingsExploring Répertoire de vedettes-

matière (RVM) and others

Page 36: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

36

“Authorities & Vocabularies”Also includes:

Thesaurus for Graphic Materials (TGM)

MARC geographic area codesMARC language codesMARC relator codesPreservation Events … etc.

Page 37: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

37

“Authorities & Vocabularies”

BenefitsServers can download entire controlled vocabularies and the values within them, in multiple formats

Available for free on the Web

Page 38: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

38

“Authorities & Vocabularies”

Human end-users can Search and view individual headings and data elements Details of the recordVisualization

Suggest additions, changes

Page 39: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

39

Page 40: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

40

Page 41: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

41

Page 42: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

42

URI for specific LCSH records/ concepts:id.loc.gov/authorities/[LCCN]id.loc.gov/authorities/sh8508803

“Authorities & Vocabularies”

Page 43: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

43

Page 44: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

44

Page 45: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

45

Contact informationContent of site: Libby Dechman, [email protected] questions: Larry Dixson, [email protected]

“Authorities & Vocabularies”

Page 46: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

46

A comment form and discussion list are available at

“Authorities & Vocabularies”

http://id.loc.gov/authorities/contact.html

Page 47: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

47

RDA Controlled Vocabularies - Registries

Free on the Web at Open Metadata Registry

http://metadataregistry.org/schema/list.html

Page 48: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

http://metadataregistry.org/rdabrowse.htmhttp://metadataregistry.org/rdabrowse.htm

Page 49: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Carrier type

Page 50: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

URI

Page 51: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

RDA Carrier Types

URI

Page 52: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

RDA Linked DataRDA Linked Data

Don Quixote

Madrid, 1979

English

Spanish

French

German

Cervantes

Library of CongressCopy 1Green leather binding

Exemplary novels

Wasserman

The Man of La Mancha

Tex

t

Movies…

Derivative

works

Subject

created

created created

Page 53: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

53

RDA Linked Terms for Languages

Don Quijote

Madrid, 1979

Inglés

Español

Francés

Alemán

Cervantes

Library of CongressCopia 1Encuadernación en piel color verde

Novelas Ejemplares

Wasserman

The Man of La Mancha

Text

oPelículas …

Obras

derivadas

Mater

ias

Page 54: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

Internet “Cloud”

Web frontend

ServicesVIAF

Databases, Repositories

LCSH