16
Study report on SMM Study report on SMM process process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo [email protected] [email protected] ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo [email protected] [email protected] ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

Embed Size (px)

Citation preview

Page 1: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

Study report on SMM processStudy report on SMM process

2007. 12. 6Tae-Hoon Lim and Tae-Sul Seo

[email protected] [email protected]

ISO/IEC JTC1/SC32 WG2 Interim MeetingSeoul, Korea

Page 2: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

2

Background

According to the resolution of SC32 New York meeting (SC32N1604a), the study on Semantic Harmonization of Metadata was performed.

Reference: SC32N1658

Page 3: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

3

Summary

Title was changed. The procedures were modified.

Name of each step was changed The 2nd and 3rd steps can be replaced by each other.

A description system for mapping was established.

Page 4: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

4

Title change

From “semantic harmonization of metadata” To “semantic metadata mapping (SMM)

process”

The later is more specific expression than the former.

Page 5: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

5

Procedure modification

1st Surveying metadata sets

2nd Constructing common DECs based on 11179

4th Completing crosswalks

3rd Grouping data elements by the DECs

1st Collecting metadata schema

2nd Grouping attributes

4th Mapping into a table

3rd Finding common DECs

Page 6: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

6

Overall Process

1st Collecting metadata schema

2nd Grouping attributes

4th Mapping into a table

3rd Finding common DECs

The 2nd and 3rd steps can be replaced by each other.

Semantic Metadata Mapping Process

Page 7: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

7

Survey and identify candidate metadata schema in a domain.

Surveying form includes: Domain name, Service DB name, or an other equivalent name. Number of fields Sample data Value domains

1st. Collecting metadata schema

Semantic Metadata Mapping Process

Page 8: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

8

Selecting a metadata set as a primary metadata set. The simplest or the highest level metadata set is desirable to be the

primary one.

For all available metadata schema, attributes should be aggregated by the attributes of the primary metadata set.

There may exist attributes which aren’t fitted to any of them. Some attributes, which are not important, may be removed.

The remaining are grouped separately.

Metadata experts should perform the work along with domain experts.

2nd. Grouping attributes

Semantic Metadata Mapping Process

Page 9: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

9

Analyzing each attribute of the primary metadata set and find out an object class and a property hidden in and related to the attribute.

Constructing common DECs based on ISO/IEC 11179 standard using the object classes and the properties.

If there exists an attribute which isn’t fitted to any of the DECs, a new DEC may be constructed for them.

3rd. Finding common DECs

Semantic Metadata Mapping Process

Page 10: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

10

Finally, arranging all attributes into a table by the common DECs. Comments on the types of mapping can be included in the table as bell

ow. Same, no difference: no description Level difference: upper/lower terms Domain difference: generic/specific (book, technical report, article, …) Term difference: synonym, antonym or preferred term Naming rule difference: Order or representation rules

A recommended set of metadata can be provided for guiding future standardization.

4th. Mapping into a table

Semantic Metadata Mapping Process

Page 11: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

11

Domain: e-Book

(1st) Available metadata sets: OpenEBPS, MODS and TEI primary metadata set: OpenEBPS

Application to e-Book

OpenEBPS MODS TEI header

Domain name Description of Electronic Book

Description of Library resources

Encoding methods for machine-readable texts

Number of fields 15 About 60 (top level: 20) Over 20

Sample data yes no yes

Page 12: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

12

(2nd) Grouping attributes

Application to e-Book

OpenEBPS MODS TEI

title titleInfor:title fileDesc:titleStmt:title

  titleInfor:subTitle fileDesc:seriesStmt:title

  titleInfor:partNumber fileDesc:seriesStmt:idno

  titleInfor:partName  

  titleInfor:nonSort  

creator(role) name:role  

creator(file-as) name:namePart fileDesc:titleStmt:author

  name:displayForm  

  name:affiliation  

  name:discription  

subject subject:topic profileDesc:textClass:keyword

  classification profileDesc:textClass:classCode

  subject:catographics profileDesc:textClass:catRef

  subject:occupation  

Page 13: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

13

(3rd) Constructing common DECs based on 11179: Object class: e-Book Properties: title, author, subject, abstract, publisher, distri

butor, authority, contributor, publication-date, genre, format, extent, identifier, language, coverage-geographic, coverage-temporal, right, location, edition

DECs: ebookTitle, ebookAuthor, ebookSubject, ebookAbstract, ebookPublisher, ebookDistributor, ebookAuthority, ebookContributor, ebookPublication-date, ebookGenre, ebookFormat, ebookExtent, ebookIdentifier, ebookLanguage, ebookCoverage-geographic, ebookCoverage-temporal, ebookRight, ebookLocation, ebookEdition

Application to e-Book

Page 14: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

14

(4th) Mapping into a table

Application to e-Book

DEC OpenEBPS   MODS   TEI   Recommaned DE

ebookTitle title   titleInfo:title   titleStmt:title   ebookTitle

      titleInfo:subTitle   seriesStmt:title T:pre ebookSubtitle

ebookAuthor creator(role)   name:role      

  creator(file-as) T:pre name:namePart D:gen titleStmt:author N:rep ebookAuthorName

ebookSubject subject N:rep subject:topic T:pre textClass:keyword N:rep ebookSubjectWord

      classification N:rep textClass:classCode N:rep ebookSubject-classCode

          textClass:catRef T:pre  

L - up: upper term/lo: lower term

D - generic: gen/…

T - syn: synonym/ant: antonym/pre: preferred term

N - ord: order/rep: representation

Page 15: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

15

Future plan

The SMM process will be elaborated more in order to be proposed as a new work item in ISO/IEC JTC1/SC32 next year.

Page 16: Study report on SMM process 2007. 12. 6 Tae-Hoon Lim and Tae-Sul Seo taehoon@dpc.or.kr tsseo@kisti.re.kr ISO/IEC JTC1/SC32 WG2 Interim Meeting Seoul, Korea

2007-12-06 SC32 WG2 Interim Meeting, Seoul, Korea

16

Thank you!