ISO/IEC 11179, Part 2: Classification Schemes Jim Carpenter Bureau of Labor Statistics Nancy Lawler...

Preview:

Citation preview

ISO/IEC 11179, Part 2: Classification Schemes

Jim CarpenterBureau of Labor Statistics

Nancy LawlerDepartment of Defense

Open Forum on Metadata Registries 2003January 20 – 24, 2003Santa Fe, New Mexico

Disclaimer

• Part 2 is a working draft– Significant deficiencies

• This presentation proposes a new model– Not compliant with Part 3 (International

Standard)– Strategy is to decide what is needed & then

write a compromise compliant version of Part 2– New version to be backwards compatible with

compliant version

Thanks to many folks in the following communities that contributed ideas at this Open Forum

• Terminology Standards (TC 37, SC4)

• Statistical Community (UN Standards)

• Others in JTC1, SC 32, WG2 & WG4

Apologies if we missed some.

Classification SchemeISO standard definition

the descriptive information for an

arrangement or division of objects

into groups based on characteristics,

which the objects have in common.

Classification Scheme• the descriptive information for an arrangement or

division of objects into groups – information for

• a process – classifying objects

• a result – characterization of objects

• based on characteristics, which the objects have in common – Classification methods are out of scope

– Counter examples • 19th Century RR ticket fee classification (a turtle is an insect) –

The Platypus and the Mermaid: And Other Figments of the Classifying Imagination by Harriet Ritvo

• Aborigine category for “women, fire, and dangerous things” (book by same title by G. Lakoff)

• Fuzzy categories, e.g., bald men (Fuzzy set theory, Lotfi Zadeh )

utility: find & understand

Benefits of Part 2

• Classifying administered items promotes– Understanding– Finding

• Registering classification schemes promotes– Sharing – Improving

• Classification schemes

• Systems

Classification_Scheme_Item_Relationshiptype_description [1..1] : String

Classification_Schemeadministration_record [1..1] : Administration_Recordtype_name [1..1] : String

Classification_Scheme_Itemvalue [1..n] : Stringtype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Category concept

Item to classify

Classification RegionConformant Version

Where is the informationfor classifying?

Classification RegionProposed Version

(tentative)

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Administration Record

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Attributes Inherited Through Administration Record

(No changes.)

• Terminology Attributes

• Administration Attributes

• Reference Document Attributes

• Submission Attributes

• Stewardship Attributes

• Registration Authority Attributes

Terminology Attributes• name• preferred name (yes or no)• language identifier of name• definition • preferred definition (yes or no)• language identifier of definition• source reference• context administration record• context description• context language identifier

Designation

Definition

Administration Attributes• item identifier• registration status• administration status• creation date• last change date• effective date• until date• change description• note & explanatory comment• unresolved issue• origin

Reference Document Attributes

• identifier

• type description

• language identifier

• title

• organization name

• organization mail address

Submission Attributes

• organization name

• organization mail address

• contact name

• contact title

• contact information

Stewardship Attributes

• organization name

• organization mail address

• contact name

• contact title

• contact information

Registration Authority Attributes

• identifier

• organization name

• organization mail address

• documentation language identifier

• registrar identifier

• registrar contact name

• registrar contact title

• registrar contact information

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Other attributes in theClassification Region

Source artifact type

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Classification Scheme

Source Artifact Type• The type of artifact that contains the names

and/or definitions of each category• Suggests features of the classification

scheme, for example:– subject matter– intended use– structure– content

Classification Scheme

Source Artifact Type

• Class type attributes:– Generic artifact type

– Standard or Product name

– XML Schema reference

Examples of Generic Artifact Types

• key word list• thesaurus• glossary• project charter• yellow pages• product exhibit• organization chart• flow chart

• E-R model• class diagram• state diagram• activity diagram• parts diagram• taxonomy• ontology

Classification Scheme

Source Artifact Type

• UML Class Diagram• IDEF0 Process Model• Microsoft Project activity diagram• Rational Rose class diagram• XML schema URL of tool that produced the artifact• XML scheme URL of artifact

Examples

?

Classification Scheme

Source Artifact Type

Structure type

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Classification scheme

for

classification schemes

Classification Scheme

Structure Type

Classification Scheme

Structure Type• Relationships exist (yes or no)• Predominant relationship type• Hierarchy

– Single vs. multiple inheritance– Levels

• Graph (network)– Cyclic– Acyclic

• Other: category order, overlap, completeness

?

• Assume MDR has a registered classification scheme for classification scheme structures.

• Classify the structure as– Classification scheme administered item

identifier– Category concept identifier

Classification Scheme

Structure TypeSelf-referential alternative

Notation system

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Classification Scheme

Notation System(Non-term signs)

• Icon

• Sound

• Animation sequence

• Formula

• others

Standards may apply

Variance estimate

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Reference Document class type– identifier– type description– language identifier– title– organization name– organization mail address

Classification Scheme

Variance Estimate Reference

Identifier

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Category Concept

Identifier

• Unique within classification scheme

• Global identifier (within MDR)– Classification scheme identifier plus– Category concept identifier

Term Entry

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Definitiondefinition_text [1..1] : Stringpreferred_definition [0..1] : True_Falsesource_reference [0..1] : Reference_Document

Terminological_Entry

Language_Sectionlanguage_identifier [1..1] : Language_Identification

Designationname [1..1] : Stringpreferred_designation [0..1] : True_False

1..1

0..*

1..1

0..*

terminological_entry_languages

1..1

0..*

1..1

0..*

name_entry

1..1

0..*

1..1

0..*

definition_entry

0..10..1

0..10..1

term_definition_pairing

May replace with ISO 16642 model:Terminology MarkupFramework

•Cardinalities are controversial.•In Part 3, Terminological Entity is a relationship between admin. item & context.•Need more than definition for classification.

Sign Entry

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

• Sign Entry class type– Notation system identifier– Reference to the sign in the notation system

• Example– Category concept = White Owl– Notation system = Peterson’s Bird Calls– Sign Reference = Song of White Owl WAV

Category Concept Sign Entry

Type Name

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Category Concept

Type Name

Generic term for the type of category

associated with the classification scheme type

Category Concept

Type Name Examples

Classification

Scheme Name

Classification Scheme Type

Category Concept Type Names

Dewey Decimal System hierarchical Subject, Discipline

International Biological Taxonomical System

Taxonomy Taxon

ISO/IEC 11179 Registry UML Model

UML Model Class

Washington DC Yellow Pages

Yellow Pages Business category, Professional category

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

type description

Category Concept Relationship

Type Description

• Text describing relationship type of a category concept

• Description varies by – Type of classification scheme– Application– Each pair of category concepts

Category Concept Relationship

Type DescriptionExamples

Classification

Scheme Name

Category Concept pair

Category Concept Relationship Type Description

Dewey Decimal System Politics & Religion

None

International Biological Taxonomical System

Homo & sapiens Subtype

ISO/IEC 11179 Registry UML Model

Classif. Scheme & Category concept

Membership

Washington DC Yellow Pages

Airport & Airport transportation

Business subtype

Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)

Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document

Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String

1..*

0..*

1..*

0..*

membership

0..*

0..*

0..*

0..*

Administered_Itemadministered_item-administration_record [1..1] : Administration_Record

0..*

0..*

0..*

0..*

classification

Facet

Category Concept Relationship

Facet• Characteristic used to separate categories

within a category

• Other terms for facet: dimension, slot, …

• Example– Parent category: writing instrument– Facet: marking material– Child categories: chalk, ink pen, lead pencil, …

Questions• Jim Carpenter

– CarpenterJ@BLS.GOV– Bureau of Labor Statistics

• Nancy Lawler– NKLawler@earthlink.net– Department of Defense

Recommended