20
Strategies LLC Taxonomy 28 August 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate Circle Working Session Joseph Busch

Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

Embed Size (px)

Citation preview

Page 1: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

Strategies LLCTaxonomy

28 August 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved.

Metadata and Controlled Vocabularies

Global Corporate Circle Working Session

Joseph Busch

Page 2: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

2Taxonomy Strategies LLC The business of organized information

Focus of this session

Best practices for specifying and using controlled vocabularies in DC-compliant information management applications.

Tradeoffs and best practices around organization-dependent vs. sharable common controlled vocabularies.

Tagging content for internal vs. external audiences using the same metadata and controlled vocabularies.

When and how to map different taxonomies to each other.

Page 3: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

3Taxonomy Strategies LLC The business of organized information

For us, taxonomy work includes:

Metadata specification defines the properties needed to describe content so that it can be found & used.

Vocabularies are collections of terms that are used to specify some of the metadata properties.

Some vocabularies are big and hierarchical, some are small and flat.

An application profile specifies what metadata & vocabularies are required, and then represents them formally.

Page 4: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

4Taxonomy Strategies LLC The business of organized information

Best practices (1)

Intranet and public taxonomies should be based on a common metadata specification and shared value vocabularies.

Some metadata attributes are directly mapable to DC, some will be local (locally declared).

Use qualified Dublin Core attributes. Some vocabularies are sharable industry standards, while

others will be organization-dependent. Some value vocabularies will be particularly relevant to

intranet content.

Page 5: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

5Taxonomy Strategies LLC The business of organized information

ElementData Type Length

Req. /Repeat Source Purpose

Asset Metadata …

Title String Variable 1 User supplied Text search & results display.

Content Type String Variable 1 Local Value VocGroup & filter search results.

Center String Variable 1 Local Value Voc

Date Date Fixed 1 System suppliedPublish, feature, review

content.

Subject Metadata …

Activity String Variable * Local Value Voc

Search for, browse, group & filter search results.

Law String Variable * Standard Value Voc

Product String Variable * Standard Value Voc

Brand String Variable * Standard Value Voc

Company String Variable * Standard Value Voc

Condition String Variable * Standard Value Voc

Topic String Variable * Local Value Voc

Link Metadata …

Relation String Variable * Validate by lookup Reference related resources.

Use Metadata …

Audience String Variable * Local Value VocTarget, personalize content.

Geography String variable * Standard Value VocLegend: ? – 1 or more * - 0 or more

FDA Metadata specification (excerpt)

Page 6: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

6Taxonomy Strategies LLC The business of organized information

ElementData Type Length

Req. /Repeat Source Purpose

Asset Metadata …

Title String Variable 1 User supplied Text search & results display.

Content Type String Variable 1 Local Value VocGroup & filter search results.

Center String Variable 1 Local Value Voc

Date Date Fixed 1 System suppliedPublish, feature, review

content.

Subject Metadata …

Activity String Variable * Local Value Voc

Search for, browse, group & filter search results.

Law String Variable * Standard Value Voc

Product String Variable * Standard Value Voc

Brand String Variable * Standard Value Voc

Company String Variable * Standard Value Voc

Condition String Variable * Standard Value Voc

Topic String Variable * Local Value Voc

Link Metadata …

Relation String Variable * Validate by lookup Reference related resources.

Use Metadata …

Audience String Variable * Local Value VocTarget, personalize content.

Geography String variable * Standard Value VocLegend: ? – 1 or more * - 0 or more

FDA Metadata specification (excerpt)

DC.Title

DC.Type

DC.Publisher

DC.Date

Local

Local

Local

Local

Local

Local

DC.Subject

DC.Relation

DCterms.Audience

DC.Coverage

DC.Format=“text/html”, DC.Language=“en”

Blue Book

Orange Book

Orange Book

Yellow Book

ICD9

USGS

Page 7: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

7Taxonomy Strategies LLC The business of organized information

Audience TypeGeographyCenter Subject

Activity

Product

Condition

Law

Brand

Company

Topic

FDA* Taxonomy

* U.S. Food and Drug Administration

All facets and sub-facets

Page 8: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

8Taxonomy Strategies LLC The business of organized information

Audience TypeGeographyCenter Subject

Activity

Product

Condition

Law

Brand

Company

Topic

FDA Taxonomy*

Consumers

Employees

Healthcare

Industry

Administration

Application & Approval

Grant-Making & Sponsorship

Investigation & Enforcement

Public Awareness

Research

Rule-Making

Training & Education

* U.S. Food and Drug Administration

Directories

Dockets

Forms

Instructions & How-To

Job Information

News

Policies & Procedures

Product Alerts

Product Information

Product Lists

Publications

Recalls

Subject Indexes

Tools & Databases

Transcripts & Statements

Warning Letters

Intranet facets– a taxonomy subset

Page 9: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

9Taxonomy Strategies LLC The business of organized information

FDA.gov tagging example: Information about what to do about bad spinach.

Taxonomy Facet Tag ValuesDC.Type Recalls

DC.Publisher CFSAN

DC.Subject.Activity Public Awareness

DC.Subject.Law n/a

DC.Subject.Product Food: Produce

DC.Subject.Brand n/a

DC.Subject.Company n/a

DC.Subject.Condition Gastroenteritis

DC.Subject.Topic Food Safety

DCterms.Audience Consumers

DC.Coverage n/a

Page 10: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

10Taxonomy Strategies LLC The business of organized information

FDA.gov tagging example: Information on “Accutane” for patients.

Taxonomy Facet Tax ValuesDC.Type Product Information

DC.Publisher CDER

DC.Subject.Activity Public Awareness

DC.Subject.Law n/a

DC.Subject.Product Drugs: Prescription Drugs

DC.Subject.Brand Accutane; isotretinoin

DC.Subject.Company n/a

DC.Subject.Condition Disease: Acne

DC.Subject.Topic Drug Information; Consumer Education

DCterms.Audience Healthcare; Consumers

DC.Coverage n/a

Page 11: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

11Taxonomy Strategies LLC The business of organized information

Inside.FDA tagging example: Instructions on how to replace a security badge.

Taxonomy Facet Tag ValuesDC.Type Forms; Instructions & How-To

DC.Publisher [applicable organizational dept]

DC.Subject.Activity Administration

DC.Subject.Law n/a

DC.Subject.Product n/a

DC.Subject.Brand n/a

DC.Subject.Company n/a

DC.Subject.Condition n/a

DC.Subject.Topic n/a

DCterms.Audience Employees

DC.Coverage n/a

Page 12: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

12Taxonomy Strategies LLC The business of organized information

Conf.

Best practices (2)

Intranet and internet content should share a common repository, but not replicate the same content in two places.

Tag content for appropriate audiences. E.g., Public, Internal, Confidential

Intranet Internet

Internal

Public PublicInternal

Intranet InternetIntranet Internet

ContentInternal

Public

Page 13: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

13Taxonomy Strategies LLC The business of organized information

Mapping taxonomies

More complicated approach than multiple attributes with multiple value vocabularies.

Cases: One-to-one. One-to many. Parallel, independent hierarchies.

If mapping is done, then business rules can be used to Automatically add attribute values. Improve search. Create multiple views into the same content.

An ontology specifies typed associative relationships Typically “Is a” relationships.

Page 14: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

14Taxonomy Strategies LLC The business of organized information

Taxonomy mapping

Case Level Benefit Example

One-to-one Easy Automatic switching Ivory Coast = Côte d’Ivoire

One-to-many Medium Automatic hedging (broadening/ narrowing)

Czechoslovakia = Czech Republic; Slovakia

Parallel, Independent Hierarchies

Hard Multiple views of same information space

Geographic vs. Political

Page 15: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

15Taxonomy Strategies LLC The business of organized information

Audience TypeLocationOrganization Products

Product Line

Application

Technology

Industry

Taxonomy

Person

“Is a” Groups of Products

Advanced relations

Page 16: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

16Taxonomy Strategies LLC The business of organized information

Product relationships provide tagging rules for product groupings

Product Product Line

Technology Application Industry

Oracle Business Activity Monitoring

Oracle Fusion Middleware

Application Server; Middleware; SOA

PeopleSoft Collaborative Supply Management

PeopleSoft Enterprise

Supplier Relationship Management

Siebel Clinical Siebel Clinical Life Sciences & Pharma

Product names are consistent labels

Generic labels

Page 17: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

17Taxonomy Strategies LLC The business of organized information

press room application http://pressroom.oracle.com/prNavigator.jsp

“Is a” Groups of Products

Page 18: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

18Taxonomy Strategies LLC The business of organized information

events application http://events.oracle.com/

“Is a” Groups of Product

“Is located” powers Google Maps mash-up

Page 19: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

Strategies LLCTaxonomy

28 August 2007 Copyright 2007 Taxonomy Strategies LLC. All rights reserved.

Questions

[email protected]

+1-415-377-7912

Page 20: Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate

20Taxonomy Strategies LLC The business of organized information

GCC (Global Corporate Circle) Topics

Change focus to large organizations including governments & government agencies.

Enterprise-Wide Metadata Applications Community (EnMAC) Is this agreeable?

2007-2008 activities. Best practices case studies. Identify and describe projects that are using DC.

– What is the best way to do this? Other activities?