26
Standards and Standards and Interoperability Interoperability Kurt Molholm Kurt Molholm [email protected] or [email protected] or [email protected] [email protected]

Standards and Interoperability Kurt Molholm [email protected] or [email protected]

Embed Size (px)

Citation preview

Standards and Standards and InteroperabilityInteroperability

Kurt MolholmKurt Molholm

[email protected] [email protected] or

[email protected]@molholm.com

StandardsStandards

Historically, the range of hardware, software, Historically, the range of hardware, software, and platform products from different vendors and platform products from different vendors made it difficult or impossible for to share made it difficult or impossible for to share data effectively. data effectively.

Software standards help enable software to Software standards help enable software to interoperate.interoperate.

Interoperability is not the same as standardsInteroperability is not the same as standards.. IInteroperabilitynteroperability does not assure access does not assure access

Types of IT StandardsTypes of IT Standards

Proprietary (de facto) Proprietary (de facto) standardsstandards • Evolve from a product

line or specific vendor• De facto standards are

widely accepted and used, but lack formal approval by a recognized standards organization.

PDF/Adobe Acrobat andPostScript are examples ofde facto standards.

Open (de jure) standardsOpen (de jure) standards • Publicly available

specifications

• Developed and adopted by some authorized standardization body, e.g.

– ISO/NISO/ANSI

– WWW Consortium

– Organization for the Advancement of Structured Information Standards (OASIS)

– The Internet Engineering Task Force (IETF)

InteroperabilityInteroperability

The ability of software and hardware on different machines from different vendors to share data.

– Source:www.webopedia.com

The ability of two or more systems, or components to exchange informationto exchange information, and to to use the informationuse the information that has been exchanged

– Source: www.globalvoice.com/index.asp

The Information The Information ContinuumContinuum

In this new information agenew information age, , knowledge is changing too fast. Not only is new knowledgenew knowledge and information being produced, but old knowledgeold knowledge is being reinterpreted and repackaged for more and varied purposes. more and varied purposes.

Accessing, evaluating and using information from a variety of sourcesvariety of sources has become a skill in itself for the user

This has put a heavy pressure on providers of heavy pressure on providers of informationinformation to deliver the right information at the right time and the right scale to the right person.

– THE GLOBAL DEVELOPMENT RESEARCH CENTERTHE GLOBAL DEVELOPMENT RESEARCH CENTER

The Raison d'être - Putting The Raison d'être - Putting

Knowledge to WorkKnowledge to Work - - Scientific Scientific

Data and InformationData and Information

Factual inputs, data, models, analyses, Factual inputs, data, models, analyses, technical information, or scientific technical information, or scientific assessmentsassessments related to such scientific and engineering disciplines .

This includes any communication or any communication or representation of knowledgerepresentation of knowledge such as facts or data, in in any medium or formany medium or form, , including textual, numerical, graphic, cartographic, narrative, or audiovisual forms.

ConsiderationsConsiderations

Metadata and Information Metadata and Information DiscoveryDiscovery

MetadataMetadata - Commonly divided into - Commonly divided into – descriptive metadata – structural metadata – administrative metadata

DiscoveryDiscovery - General term covering all - General term covering all strategies and methods of finding information strategies and methods of finding information in a digital library.in a digital library.

– Source:www.cs.cornell.edu/wya/DigLib/MS1999/glossary.html

Metadata and Information Metadata and Information DiscoveryDiscovery

Forms of Digital ObjectsForms of Digital Objects• AudioAudio:: Recorded or broadcast sound, including speech, music,

animal calls, nature sounds, synthesized sounds, and noise.

• TextText:: Printed or handwritten material accessible to the naked eye.

• Visual Material - MotionVisual Material - Motion:: Moving visual images, possibly accompanied by a sound track. Includes motion video and animation.

• Visual Material - Non-motionVisual Material - Non-motion:: Any static visual material, including still photographs, prints, drawings, posters, atlases, and raster maps.

• Numeric MaterialNumeric Material:: Significant numeric content, either statistical, coded, or graphically displayed

Persistence and Persistence and PresentationPresentation

Persistence Persistence • There is a need for Bits and bytes to be labeled or

referenced in such a way that they can be reliably found over time.

– Global uniqueness.Global uniqueness. The same identifier will never be assigned to two different resources.

– Support distributed naming and resolutiondistributed naming and resolution.. – Must support both tangible and intangible objects.both tangible and intangible objects.

PresentationPresentation• Traditional Paper and Ink• Digital Formats• Multimedia

Maintenance and Maintenance and PreservationPreservation

Maintenance - Maintenance - • Permit changes (e.g. errata)• Version Control

Preservation - Preservation - • Attempt to ensure long-term presentation

• Attempt to ensure long-term discovery and access

Standards, Guidelines, and Standards, Guidelines, and so forthso forth

MetadataMetadata

ISO 10646 and UnicodeISO 10646 and Unicode - In 1991, the ISO Working Group responsible for ISO/IEC 10646 (JTC 1/SC 2/WG 2) and the Unicode Consortium decided to create one universal standard for coding multilingual text.

Unicode Standard identifies the corresponding version of ISO/IEC 10646. Start with http://www.unicode.org/versions/

The Dublin CoreDublin Core Metadata Initiative - an open forum engaged in the development of interoperable online metadata standards

MetadataMetadata

A scientific data repository should allow A scientific data repository should allow metadata searchingmetadata searching. . Examples• US Federal Geographic Data Committee approved

the Content Standard for Digital Geospatial Content Standard for Digital Geospatial MetadataMetadata www.fgdc.gov/metadata/metadata.html

• indecs>indecs> Data Dictionary (Digital Object Indentifier - DOI Related). (Scientific data sets may be identified by DOIs, and several efforts are now underway in this area)

Search, Discovery, & Search, Discovery, & AccessAccess

Search the WebSearch the Web on your own using: on your own using: • Local Sources and Personal BookmarksLocal Sources and Personal Bookmarks • Search EnginesSearch Engines such as Google, Yahoo

Search the WebSearch the Web using using Portals and Meta Search Portals and Meta Search EnginesEngines (content through one single (content through one single interface)examples:interface)examples:• MetaSearch enginesMetaSearch engines such as Dogpile, Vivísimo, CSA

Illumina Multisearch, Elsevier’s Scirus• Directory of Open Access JournalsDirectory of Open Access Journals - A service that

covers free, full text, quality controlled scientific and scholarly journals http://www.doaj.org/

Search, Discovery, & Search, Discovery, & AccessAccess

Portals and Meta Search EnginesPortals and Meta Search Engines examples examples cont.:cont.:• CODATA/ICSTI PrototypeCODATA/ICSTI Prototype Portal on Permanent

Access to Scientific Data and Information http://www.nap.edu/shelves/codata/index.html

• Science.govScience.gov - a gateway to authoritative selected science information of U.S. Government agencies. http://www.science.gov/

• Online JOurnals Search EngineOnline JOurnals Search Engine www.ojose.com

Note: , SearchEngineWatch,Science Search Engines is a Note: , SearchEngineWatch,Science Search Engines is a good reference sourcegood reference source

Search, Discovery, & Search, Discovery, & AccessAccess

Community of ScienceCommunity of Science search engine to find home pages and contact addresses for scientists.

ISI Web of KnowledgeISI Web of Knowledge ElsevierElsevier

• ScienceDirect• Engineering Information

– Engineering Village 2• ChemVillage

CAS SciFinderCAS SciFinder FAOSTATFAOSTAT an on-line and multilingual database currently

containing time-series records covering international statistics

Search, Discovery, & Search, Discovery, & AccessAccess

Enterprise Servers Enterprise Servers - - collectively serve the needs of an enterpriseenterprise rather than a single user.• MarkLogic’s XML Content Server

• IXIASOFT’s TEXTML Server XML Content Server

PersistencePersistence

Uniform Resource NameUniform Resource Name - URNs are - URNs are persistent and unique identifiers of Internet persistent and unique identifiers of Internet documents. URNs are globally unique, documents. URNs are globally unique, persistent, and accessible over the network.persistent, and accessible over the network.

Uniform Resource IdentifierUniform Resource Identifier (URI) a compact (URI) a compact sequence of characters that identifies an sequence of characters that identifies an abstract or physical resource. abstract or physical resource.

PersistencePersistence

ARK (Archival Resource Key)ARK (Archival Resource Key) is a system primarily is a system primarily designed for custodians of archived digital objectsdesigned for custodians of archived digital objects

The The Handle SystemHandle System generally conforms with the URN generally conforms with the URN framework. The DOI system of unique identifiers is framework. The DOI system of unique identifiers is based on the Handle Systembased on the Handle System

Persistent Uniform Resource LocatorPersistent Uniform Resource Locator (PURL) was (PURL) was developed and implemented by OCLC. Intended as an developed and implemented by OCLC. Intended as an interim system to be used until the URN framework is interim system to be used until the URN framework is

well established.well established. • Source: The National Library of Australia's Preserving Access

to Digital Information (PADI)

PresentationPresentation

Traditional Paper and Ink (Meta-) Markup Traditional Paper and Ink (Meta-) Markup LanguagesLanguages - -"meta" languages because they are used for defining markup languages. • SGML (Standard Generalized Markup Language)

ISO 8879:1986

• HTMLHTML (HyperText Markup Language)

• XMLXML (eXtensible Markup Language - ISO 15022). XML has been designed for ease of implementation and for interoperability with both SGML and HTML. Flynn, P (Ed.), The XML FAQ v.4.41, Cork, 2006-01-07, http://xml.silmaril.ie/

PresentationPresentation

PaperPaper - - PDF/Adobe Acrobat, Word Digital FormatDigital Format

See ISO Committee JTC 1/SC 29 “Coding of audio,

picture, multimedia and hypermedia information”• Moving Picture Experts Group (MPEG)Moving Picture Experts Group (MPEG) ISO 11172

• Joint Photographic Experts Group (JPEG) and JPEG File Joint Photographic Experts Group (JPEG) and JPEG File Interchange Format (JFIF)Interchange Format (JFIF) ISO/IEC 15444-12:2005

http://www.jpeg.org/index.html

• Graphic Interchange Format (GIF)Graphic Interchange Format (GIF) can be displayed on almost all web browsers

• Real AudioReal Audio - a proprietary audio format

Maintenance & Maintenance & PreservationPreservation

Access ControlAccess Control• Intellectual Property Notices• Handles System

Open Archival Information SystemOpen Archival Information System (OAIS) (OAIS) Reference ModelReference Model ( (ISO 14721:2003)ISO 14721:2003)

XML XML (ISO 15022)(ISO 15022) URL/URI systemsURL/URI systems

• Version Control Z39.50 Z39.50 (ISO 23950:1998) (ISO 23950:1998) Presentation softwarePresentation software

ConclusionConclusion

““the beauty of standards is that there are so the beauty of standards is that there are so many to choose from”many to choose from”

Interoperability is not the same as standards.Interoperability is not the same as standards. Interoperability does not assure accessInteroperability does not assure access The Information ContinuumInformation Continuum means

considering both the near-term and the long-near-term and the long-termterm AND multiple presentation formsmultiple presentation forms

““A little knowledge is a dangerous thing, but it A little knowledge is a dangerous thing, but it is not half so bad as a lot of ignorance.”is not half so bad as a lot of ignorance.”

Terry Pratchett, “Equal Rites”

Information and Information and Knowledge Transfer Is the Knowledge Transfer Is the Raison d'êtreRaison d'être

WisdomWisdom is the result of learning and using is the result of learning and using knowledge for a strategic advantage. After knowledge for a strategic advantage. After gaining knowledge, wisdom is used to meet gaining knowledge, wisdom is used to meet new situations. Wisdom resides in the minds new situations. Wisdom resides in the minds of the users.of the users.

» Stuhlman Management Consultants, Knowledge Management Terms

KnowledgeKnowledge can be conveyed, but not wisdom. can be conveyed, but not wisdom.» Siddhartha by Hermann Hesse