Dewey in Sweden, Sweden in Dewey: Classification … · in Dewey: Classification in a ... LCSH -...

Preview:

Citation preview

Dewey in Sweden, Sweden in Dewey: Classification in a Local/Global Context

Seminarium om Dewey och klassifikationens roll nationellt och internationelltStockholm5 February 2009

Joan S. MitchellEditor in Chief, DDCOCLC

Outline

• Dewey’s benefits

• What we are doing to keep (and increase) Dewey’s usefulness

• Some interesting applications

• What can Swedish librarians do right now?

• Discussion

1. What are Dewey’s benefits?

• Language-independent representation

• Large amount of categorized content

• Interoperable translations

• Mappings and crosswalks

• Organizational support

• Worldwide user community

Language-independent representation

VietnameseLập trình005.1

Swedish & NorwegianProgrammering005.1

SpanishProgramación005.1

RussianПрограммирование005.1

ItalianProgrammazione005.1

GreekΠρογραμματισμός005.1

GermanProgrammierung005.1

FrenchProgrammation005.1

EnglishProgramming005.1

Arabic005.1َبْرَمَجة

Large amount of categorized content

• DDC used by 200,000+ libraries in 138 countries

• ~25% of WorldCat records include explicit Dewey numbers (more about derived numbers later)

• Number is growing, e.g., Deutsche Nationalbibliothek is now adding Dewey numbers to WorldCat nearly at the same rate as the Library of Congress

Translations

• Translations published in following languages since 1998: Arabic, French, German, Greek, Hebrew, Icelandic, Italian, Norwegian, Russian, Spanish, Turkish, and Vietnamese

• Updated top levels available in: Arabic, Chinese, Czech, French, German, Hebrew, Italian, Norwegian, Portuguese, Russian, Spanish, Swedish, and Vietnamese

• Discussions under way: Indonesian abridged edition, French and Greek full web versions, approaches to web versions in Norway and Sweden

Translations: Localization

DDK 5

781.621-781.729

O2 Stilistisk innflytelse fra andre musikktradisjoner

Til 02 legges sifrene som følger etter 781.6 in 781.63-781.69, f.eks. jazzens innflytelse på skandinavisk folkemusikk 781.62395025

DDC 22

781.621-781.729

02 Stylistic influence of other traditions of music

Add to 02 the numbers following 781.6 in 781.63-781.69, e.g., influence of jazz on Spanish folk music 781.6261025 . . .

Translations: Interoperable Expansions

—43551 Regierungsbezirk Köln—435511 Aachen—435512 Kreise Aachen, Heinsberg, Düren, Euskirchen

—4355122 Kreis Aachen

—4355124 Kreis Heinsberg

—4355126 Kreis Düren

—4355128 Kreis Euskirchen

—435513 Rhein-Erft-Kreis

—435514 Köln—435515 Leverkusen

—435516 Rheinisch-Bergischer-Kreis, Oberbergischer Kreis

—4355163 Rheinisch-Bergischer-Kreis

—4355167 Oberbergischer Kreis

—435518 Bonn—435519 Rhein-Sieg-Kreis

Mappings to subject headings (1)

Library of Congress Subject Headings (LCSH)

Medical Subject Headings (MeSH)

Canadian Subject Headings (CSH)

Sears List of Subject Headings (Sears)

Book Industry Standards and Communications (BISAC) Subject Headings

Mappings to subject headings (2)

RAMEAU [French]

Schlagwortnormdatei (SWD) [German]

Nuovo Soggettario [Italian]

Sears Lista de Encabezamientos de Materia [Spanish]

(more about derived mappings later)

Mappings in WebDewey

LCSH - DDC

mappings

Mappings in Abridged WebDewey

Mappings in MelvilClass

SWD - DDC mappings

Nuovo Soggettario - DDC

RAMEAU - DDC (broad level only)

620 $aAnesthesia$vLCSH (en ligne), 2005-02-21622 1 $aAnesthesia$vMeSH (en ligne), 2005-02-21624 $a610

Crosswalks between schemes

LCC – DDC (ClassWeb, Classify)

UDC – DDC (IZUM, Czech National Library)

SAB – DDC (Electronic updated version at National Library of Sweden)

Organizational Support

Permanent editorial staff at LC and OCLC

International advisory board (EPC)

International user community

Research (OCLC + partners around the world)

Dewey Community

ACOC National Libraries

ALA Translation Teams

CILIP EDUG

NKKI Research Partners

SABINET

. . .

Editorial Policy Committee

LC & OCLC

DDC Editors

Dewey Users around the World

2. What are we doing to Dewey?

• Content (updates and transformations)

• New forms of representation

Content

Translations

New topics (Semantic web), expansions (blogs / social networks), events (elections), boundaries (Italian provinces), views (abortion), etc.

(short-term)

Education

Religion Law

Foods/Meals Music

Groups of people

(long-term)

Continuous updating Transformations

Content: Annual Additions

Schedule and table numbers: 120/year

Built numbers: 450/year

Mapped headings: 1800/year*

*as of July 2008

Content: Full Edition Database(December 2008)

Schedule numbers: 26,715

New Schedule Number: 006.752 Blogs

Content: Full Edition Database(December 2008)

Schedule numbers: 26,715

Tables 1-6: 9,356

New Table Number: T2—45674 Fermo province

Content: Full Edition Database(December 2008)

Schedule numbers: 26,715

Tables 1-6: 9,356

Built schedule numbers: 13,310

New Built Schedule Number: 782.42162916

Content: Full Edition Database(December 2008)

Schedule numbers: 26,715

Tables 1-6: 9,356

Built schedule numbers: 13,310

Built table numbers: 609

New Built Table Number: T5—9276264

Content: Abridged Edition Database(December 2008)

Schedule numbers: 4,937

Tables 1-4: 522

Built schedule numbers: 401

Built table numbers: 9

Transformations

In many areas, the standard sequence assumes an underlying “universal”viewpoint that is not universal, e.g., food, religion, education, music

Food and meals

Rethink food and meals in a global context

What is a sandwich?

Smörgåsar

DDC 22:

641.84 Sandwiches

Including burritos, tacos, wraps; submarine sandwiches

Sandwiches: Proposed Update

641.84 Sandwiches and related dishes

Standard subdivisions are added for either or both topics in heading

Class here sandwiches and related dishes of any type, e.g., open-faced sandwiches, grilled sandwiches, wraps

Meals: Current outline

641.52 Breakfasts

641.53 Luncheons, lunches, brunches, teas,suppers, snacks

641.54 Dinners

Meals: Proposed outline

641.52 First meal of the day

641.53 Light meals and snacks

641.54 Main meal of the day

200 Religion

200 Religion

210 Philosophy and theory of religion

220 Bible

230-280 Christianity

290 Other religions

Class 2 in UDC: Chronological/Regional Development

21 Prehistoric religions

22 Religions of Far East origin

23 Religions originating in Indian subcontinent

24 Buddhism

25 Religions of antiquity. Minor cults and religions

26 Judaism

27 Christianity

28 Islam

29 Modern spiritual movements

New View of 200 Religion (excerpt)

Taoism (299.514)Confucianism (299.512)Hinduism (294.5)Jainism (294.4)Buddhism (294.3)Wicca (299.94)Zulu (African people)—religion (299.683986)Voodoo (299.675)Ras Tafari (299.676)Bible (220)Judaism (296)Christianity (230)Islam (297)Scientology (299.936)

370 Education

Can we provide a global framework that addresses local and global needs (e.g., levels of education, curricula, policies)?

Levels of primary education

DDK 5

372.241 Småskoletrinnet (1.-4. klasse)

372.242 Mellomtrinnet (5.-7. klasse)

372.243 Ungdomstrinnet (8.-10. klasse)

DDC 22

372.241 Lower level (grades 1-3)

372.242 Upper level (grades 4-6)

Class middle schools (grades 5-8), junior high schools in 373.236

780 Music

Evolution of musical styles brings:

compression of styles

expansion of styles

hybridization of styles

Approaches

•Shallow developments with deep indexing and mappings

•Expansions

•Synthesis (number building) for hybrid styles

Example: Shallow developments

Current entry

781.66 Rock (Rock ‘n’ roll)

Including acid, folk, hard, punk, soft rock

Proposed entry

781.66 Rock (Rock ‘n’ roll)

Class here specific rock styles

. . .

Example: Indexing

Relative Index entries at 781.66 to include:

New wave music

Soft rockKrautrock

Rockabilly musicHard rock

Punk rockAlternative rock music

Psychedelic rockAcid rock

Example: Current Mappings at 781.66

Example: ExpansionExample: Expansion

781.648 Electronica

Class here specific electronica styles

Class comprehensive works on electronic music in 786.7

Example: Hybrid music styles

In add table under 781.63—781.69:17 Hybrid styles

Fusion of two or more styles from different traditions of music to create a new style

Add to 17 the numbers following 781.6 in 781.62–781.69, e.g., fusion with folk music 172, folk rock 781.66172

See Manual at 781.6: Hybrid styles

. . .

Representation

• Use of and extensions to MARC 21 formats for representation of the DDC

• Development of a Uniform Resource Identifier (URI) structure for the DDC (Michael Panzer)

• Experimentation with DDC in SKOS (Michael Panzer)

• Investigation of formal specification of relationships in the DDC (Rebecca Green and Michael Panzer)

Representation: Dewey in MARC 21 Formats

• Decision to use MARC 21 formats for classification and authority data in new Editorial Support System (standard, detailed representation, flexibility to drive other representations)

• Development of proposed extensions to support representation and access in cooperation with DNB, LC, and OCLC

Dewey Class Record in MARC Classification Format

Dewey Class Record in XML (Partial)

Dewey Class Record (Formatted View)

Relative Index Record in MARC Authority Format

Some Extensions to MARC 21

• Identification of notation in internal add tables

• Representation of component parts of numbers

• Accommodation of full and partial “access” numbers

Notation in Internal Add Tables: New $y Subfield

Example from MARC Classification format:

153 ## $a 930 $c 990 $y 1 $a 004 $j Ethnic and national groups

[Notation 004 in the internal add table located at 930-990]

Component Parts of Numbers

Inclusion of component parts of numbers in bibliographic records using a new 085 field, based on the 765 field in the classification format

Component Parts Example:Feminist Criticism of Television

082 01 $8 1 $a 791.45082 $2 22

085 ## $8 1.1 $b 791.45 $z 1 $s 082

Television Feminist

Access Numbers

Provision for assignment of access numbers (additional DDC numbers, notation from Tables 1-6, internal table notation) in bibliographic records

Access Numbers: Examples (1)

Tunnels in the Swiss Alps

082 00 $a 388.13 $2 22

083 0# $z 2 $a 4947 $2 22/ger $q DE-101b

T2—4947 (Swiss Alps)

German DDC 22

Assigned by Deutsche Nationalbibliothek

Access Numbers: Examples (2)

History of Norway, Sweden, and Denmark

082 00 $a 948 $2 22 (Scandinavia)

083 0# $a 948.1 $2 22 (Norway)

083 0# $a 948.5 $2 22 (Sweden)

083 0# $a 948.9 $2 22 (Denmark)

Access Numbers: Examples (3)

Lyng, Selma Therese, 1972-Være eller lære? : om elevroller, identitet og læring i

ungdomsskolen/ Selma Therese Lyng. - Oslo : Universitetsforl., cop. 2004. -

215 s.- (370.153)(372.243)ISBN 82-15-00597-7 (h.) : Nkr 249.00

082 14 $a 372.243 $2 DDK5

082 04 $a 373.236 $2 22

083 1# $a 370.153 $2 DDK5

Representation: Dewey URIs

Design goals for Dewey URI structure:

• Common locator for Dewey concepts and associated resources for use in web services and web applications

• Retraceable path to concept rather than abstract identification

• Classes as center of identification for DDC concepts

Dewey URI Examples

Generic URI

http://dewey.info/class/338.4

Specific time

http://dewey.info/class/338.4/2007/05/25

http://dewey.info/class/338.4/e22

Specific time & language

http://dewey.info/class/338.4/2007/05/25/about.en

Specific time, language & format

http://dewey.info/class/338.4/2007/05/25/about.en.skos

Dewey in SKOS/RDF

SKOS (Simple Knowledge Organization System) provides a standard way to represent knowledge organization systems (KOS) using the Resource Description Framework (RDF)

Problem: Dewey is more complex than many KOS (e.g., thesauri)

• Schedules, auxiliary tables, internal tables, Relative Index

• Standard numbers, optional numbers; number spans, centered entries

• Elaborate note structure in tables and schedules + lengthy notes in the Manual

Initial Design Driven by Linked Data Needs

Linked Data:

Use URIs as names for things

Use HTTP URIs so that people can look up those names

When someone looks up a URI, provide useful information

Include links to other URIs so that they can discover more things

Tim Berners-Lee http://www.w3.org/DesignIssues/LinkedData.html

Analyzing DDC for modeling in RDF/SKOS (1)

Singled out as skos:Concepts right now:

• Listed schedule numbers (including synthesized numbers)

• Number spans

• Centered entries

• Relative Index terms (in different namespace)

Analyzing DDC for modeling in RDF/SKOS (2)

370.11 Education for specific objectives

370.113 Vocational education

370.113085 Parents--vocational education

370.1130941 Vocational education--Great Britain

370.1130973 Vocational education--United States

Career development

Career education

Education of employees

Employee development

Human resource development …

Career education

Career education--United States

Career education--United States--Curricula

Core competencies

Vocational education

Vocational training centers

Relative Index

Mapped LCSH

ddc:topic

skos:closeMatch

Analyzing DDC for modeling in RDF/SKOS (3)

370.113 Vocational education

Class here career education, occupational training, vocational schools

Class on-the-job training, vocational training provided by industry in 331.2592

For vocational education at secondary level, see 373.246; for adult vocational education, see 374.013

See also 331.702 for choice of vocation; also 371.425 for vocational guidance in schools

skos:notation

skos:prefLabel

skos:related

RDF model: Class

<class/370.113/2007/12> a skos:Concept ;

skos:inScheme <scheme/2007/12> ;

dct:created "1996-06-01T00:00:00.0-05:00"^^<http://purl.org/dc/terms/W3CDTF> ;

dct:modified "2003-03-26T00:00:00.0-05:00"^^<http://purl.org/dc/terms/W3CDTF> ;

skos:notation "370.113"^^<schema-terms/Notation> ;

skos:prefLabel "Vocational education"@en ;

skos:broader <class/370.11/2007/12> ;

skos:narrower <class/370.113085/2007/12> ,

<class/370.1130941/2007/12> ,

<class/370.1130973/2007/12> ;

skos:narrowerStructural <class/373.246/2007/12> ,

<class/374.013/2007/12> ;

skos:related <class/331.2592/2007/12> .

RDF model: Relative Index terms

<class/370.113/2007/12> ddc:topic <index/Career%20development> ,

<index/Career%20education> ,

<index/Education%20of%20employees> ,

<index/Employee%20development> ,

<index/Human%20resource%20development> ,

<index/Job%20training> ,

<index/Occupational%20training> ,

<index/Retraining%E2%80%94vocational%20education> ,

<index/Staff%20development> ,

<index/Training%E2%80%94employee%20education> ,

<index/Vocational%20education> ,

<index/Vocational%20schools> ,

<index/Vocational%20training> ,

<index/Work%20training> .

RDF model: Mapped LCSH

<class/370.113/2007/12> skos:closeMatch

<http://tspilot.oclc.org/lcsh/sh%2085020255%20> ,

<http://tspilot.oclc.org/lcsh/sh%2000002431%20> ,

<http://tspilot.oclc.org/lcsh/sh%2085144178%20> ,

<http://tspilot.oclc.org/lcsh/sh%2096002453%20> .

The Big Question

How can we make Dewey in its various representations plus mapped terminologies and associated content work harder?

3. Some interesting applications

• Dewey.info and history-of-concepts Dewey web services (Michael Panzer, OCLC)

• MelvilSearch and Multilingual MelvilClass(Lars Svensson, DNB)

• DeweyBrowser, Classify, Shelfview(Diane Vizine-Goetz, OCLC)

Dewey.info

Putting the RDF/SKOS representation to work for humans and machines

370.113: Class + Upward/Downward Hierarchies + Mapped LCSH

Dewey.info: http://dewey.info/615.4/about

Generic view in HTML of class across all editions/versions in all languages

Generic view in HTML of class across all editions/versions in all languageshttp://dewey.info/615.4/about

View of all English-language versions of that classhttp://dewey.info/615.4/about.en

HTML of a specific version of a class in a specific language http://dewey.info/615.58/2007/02/about.fr(.html)

HTML format is obtained via content negotiation: The server determines that HTML is the appropriate format for this user agent (i.e., a web browser)

HTML is annotated with RDFa! Clicking the RDF logo produces an RDF version of the HTML view

<span class="notation" property="skos:notation" datatype="ddc:Notation">615.58</span>

<a id="class“ resource="http://dewey.info/class/615.58/2007/02/about.fr" property="skos:prefLabel" xml:lang="fr" href="http://dewey.info/class/615.58/2007/02/about.fr">Pharmacothérapie</a>…

History-of-concepts web service

History of changes in the DDC:

DDC changes are exposed to users record-by-record in notes from one edition to the next

DDC changes from one edition to the next are also summarized in Lists of Changes in the print edition and as a downloadable table

Hidden from users (human and machine) is a rich set of information on changes in the underlying data file

685 MARC History Note

006.7 Multimedia systems

685 01 @t Multimedia systems, interactive video, comprehensive works on computer graphics and computer sound synthesis @i all formerly located in @b 006.6 @d1996 @221

(this information is no longer exposed in the print DDC 22 or WebDewey record)

Tracking changes

• of the scheme as a whole (snapshots/editions)

• of individual classes (contents of a class)

• of individual topics associated with a class

for

linking/updating class numbers, updating translations, maintenance of mappings, query expansion . . .

Change in knowledge organization systems

How to expose history information for machine access (1)

Standard identifier (URL):<http://dewey.info/class/004.165/>

Type of history note:<http://dewey.info/class/004.165/> ddc:relocationNote []

Normalized date:dcterms:issued “2008-08-01”^^xs:date

Relationships of note to scheme:dcterms:isPartOf <http://dewey.info/scheme/e22/>

Result of changedcterms:description "Partially changed number“@en

How to expose history information for machine access (2)

DDC numbers of affected classesddc:oldNumber "004.165"^^<schema-terms/Notation>

ddc:newNumber "004.1675"^^<schema-terms/Notation>

Affected topicddc:hasTopic "Specific handheld devices"@en

Complete note in human-readable formrdf:value "Specific handheld devices relocated to 004.1675"@en

Use: Update Mappings

Mapping relationship: “BlackBerry” to “004.165”

Timestamp: 2007-02-04

Using history information685 20 $Specific handheld devices $irelocated to

$b004.1675 $d200808 $222

Mapping Update: 004.165 [<2008-08]004.1675 [>= 2008-08]

Use: Query Expansion

Search term: “Information theory”

Resulting DDC number: 003.54

Using history information“Relocations and Discontinuations” (Ed. 20):

Ed. 19: [001.539] Ed. 20: 003.54

685 01 $tInformation theory $iformerly located in $b001.539 $d19890306 $220

Expanded query:

{001.539 19; 003.54 20; 003.54 21; 003.54 22}

Titles under “Alpiner Skilauf (Abfahrtslauf)” (796.935)

Titles under “Alpiner Skilauf (Abfahrtslauf)” + (796.935*)

Record from 796.935* Search

Multilingual MelvilClass: English

Multilingual MelvilClass: German

DeweyBrowser

DeweyBrowser (Svenska)

Dewey in WorldCat.org?

Classifyhttp://deweyresearch.oclc.org/classify2/

• DDC/LCC/NLM classifier

• Developed by OCLC Office of Research

• Based on FRBR cluster data

• Human interface + web service

Virtual reshelving

006.74

4. What can Swedish librarians do right now?

• Experiment with contributing Dewey numbers to WorldCat

• Create mappings for access vocabulary

• Load Dewey numbers into SAO authority records

• Begin planning for translation

Subject Heading/DDC Mappings

• Based on likelihood of use of heading with number

• No explicit definition of relationship beyond concurrent use

Mappings for Access

Study derivation of mappings from SAB - DDC, LCSH - DDC, SAO - LCSH

Tools:

SAB - DDC

SAO – LCSH - DDC

DDC - DDC terminology files (RI terms, LCSH, MeSH) - DDC

Dewey Numbers in Subject Heading Authority Records

• Subject entity represented by heading equals or approximates the whole of the DDC class or is in standing room

• Definition of relationship between heading and number is found in Dewey number record

Draft Guidelines for Adding Dewey Numbers to Authority Files

• The subject entity represented by the LCSH equals or approximates the whole of the Dewey class

• The subject entity represented by the LCSH is explicitly in standing room at the number

• The geographic entity represented by the LCSH has an implicit relationship to the Dewey class

• The genus/species represented by the LCSH has an implicit relationship to the Dewey class

• If the subject entity represented by the LCSH matches more than one Dewey number according to the aforementioned rules, multiple Dewey numbers may be added to the authority record

Nuovo Soggettario - DDC

BISAC - DDC

003 OCoLC¶

005 20090120222758.0¶

008 090120n| anznnbabn |n ana d¶

040 .. ‡aOCoLC-O‡beng‡cOCoLC-O¶

039 .. ‡a(OCoLC-O)MED-006000¶

039 .. ‡a(OCoLC-O)MED-6000¶

072 .7 ‡aMED‡x006000‡2bisacsh‡92198¶

083 04 ‡a617.96‡222‡5OCoLC-D¶

150 .. ‡aMEDICAL‡xAnesthesiology‡9medical anesthesiology¶

667 .. ‡aUsually do not map *ology to 362.1‡9Conversion note¶

667 .. ‡aBISAC Subject Code: MED006000, Sequence Number: 002198¶

Some Translation Planning Activities

• Undertake pilot study to test mixed Swedish-English approach

• Decide on Swedish terms for standard Dewey instructions (Including, Class here, etc.)

• Develop interoperable expansions in geography, history, etc.

• Continue to participate in EDUG working groups (education, law, archaeology)

• Plan technical environment for translation support and web version

• Study end-user tools (Swedish MelvilSearch?)

• Create an advisory board

Discussion

Let’s do it!

Tack för uppmärksamheten!

mitchelj@oclc.org

Recommended