Upload
francis-ford
View
226
Download
6
Tags:
Embed Size (px)
Citation preview
Standards and Standards and InteroperabilityInteroperability
Kurt MolholmKurt Molholm
[email protected] [email protected] or
[email protected]@molholm.com
StandardsStandards
Historically, the range of hardware, software, Historically, the range of hardware, software, and platform products from different vendors and platform products from different vendors made it difficult or impossible for to share made it difficult or impossible for to share data effectively. data effectively.
Software standards help enable software to Software standards help enable software to interoperate.interoperate.
Interoperability is not the same as standardsInteroperability is not the same as standards.. IInteroperabilitynteroperability does not assure access does not assure access
Types of IT StandardsTypes of IT Standards
Proprietary (de facto) Proprietary (de facto) standardsstandards • Evolve from a product
line or specific vendor• De facto standards are
widely accepted and used, but lack formal approval by a recognized standards organization.
PDF/Adobe Acrobat andPostScript are examples ofde facto standards.
Open (de jure) standardsOpen (de jure) standards • Publicly available
specifications
• Developed and adopted by some authorized standardization body, e.g.
– ISO/NISO/ANSI
– WWW Consortium
– Organization for the Advancement of Structured Information Standards (OASIS)
– The Internet Engineering Task Force (IETF)
InteroperabilityInteroperability
The ability of software and hardware on different machines from different vendors to share data.
– Source:www.webopedia.com
The ability of two or more systems, or components to exchange informationto exchange information, and to to use the informationuse the information that has been exchanged
– Source: www.globalvoice.com/index.asp
The Information The Information ContinuumContinuum
In this new information agenew information age, , knowledge is changing too fast. Not only is new knowledgenew knowledge and information being produced, but old knowledgeold knowledge is being reinterpreted and repackaged for more and varied purposes. more and varied purposes.
Accessing, evaluating and using information from a variety of sourcesvariety of sources has become a skill in itself for the user
This has put a heavy pressure on providers of heavy pressure on providers of informationinformation to deliver the right information at the right time and the right scale to the right person.
– THE GLOBAL DEVELOPMENT RESEARCH CENTERTHE GLOBAL DEVELOPMENT RESEARCH CENTER
The Raison d'être - Putting The Raison d'être - Putting
Knowledge to WorkKnowledge to Work - - Scientific Scientific
Data and InformationData and Information
Factual inputs, data, models, analyses, Factual inputs, data, models, analyses, technical information, or scientific technical information, or scientific assessmentsassessments related to such scientific and engineering disciplines .
This includes any communication or any communication or representation of knowledgerepresentation of knowledge such as facts or data, in in any medium or formany medium or form, , including textual, numerical, graphic, cartographic, narrative, or audiovisual forms.
Metadata and Information Metadata and Information DiscoveryDiscovery
MetadataMetadata - Commonly divided into - Commonly divided into – descriptive metadata – structural metadata – administrative metadata
DiscoveryDiscovery - General term covering all - General term covering all strategies and methods of finding information strategies and methods of finding information in a digital library.in a digital library.
– Source:www.cs.cornell.edu/wya/DigLib/MS1999/glossary.html
Metadata and Information Metadata and Information DiscoveryDiscovery
Forms of Digital ObjectsForms of Digital Objects• AudioAudio:: Recorded or broadcast sound, including speech, music,
animal calls, nature sounds, synthesized sounds, and noise.
• TextText:: Printed or handwritten material accessible to the naked eye.
• Visual Material - MotionVisual Material - Motion:: Moving visual images, possibly accompanied by a sound track. Includes motion video and animation.
• Visual Material - Non-motionVisual Material - Non-motion:: Any static visual material, including still photographs, prints, drawings, posters, atlases, and raster maps.
• Numeric MaterialNumeric Material:: Significant numeric content, either statistical, coded, or graphically displayed
Persistence and Persistence and PresentationPresentation
Persistence Persistence • There is a need for Bits and bytes to be labeled or
referenced in such a way that they can be reliably found over time.
– Global uniqueness.Global uniqueness. The same identifier will never be assigned to two different resources.
– Support distributed naming and resolutiondistributed naming and resolution.. – Must support both tangible and intangible objects.both tangible and intangible objects.
PresentationPresentation• Traditional Paper and Ink• Digital Formats• Multimedia
Maintenance and Maintenance and PreservationPreservation
Maintenance - Maintenance - • Permit changes (e.g. errata)• Version Control
Preservation - Preservation - • Attempt to ensure long-term presentation
• Attempt to ensure long-term discovery and access
MetadataMetadata
ISO 10646 and UnicodeISO 10646 and Unicode - In 1991, the ISO Working Group responsible for ISO/IEC 10646 (JTC 1/SC 2/WG 2) and the Unicode Consortium decided to create one universal standard for coding multilingual text.
Unicode Standard identifies the corresponding version of ISO/IEC 10646. Start with http://www.unicode.org/versions/
The Dublin CoreDublin Core Metadata Initiative - an open forum engaged in the development of interoperable online metadata standards
MetadataMetadata
A scientific data repository should allow A scientific data repository should allow metadata searchingmetadata searching. . Examples• US Federal Geographic Data Committee approved
the Content Standard for Digital Geospatial Content Standard for Digital Geospatial MetadataMetadata www.fgdc.gov/metadata/metadata.html
• indecs>indecs> Data Dictionary (Digital Object Indentifier - DOI Related). (Scientific data sets may be identified by DOIs, and several efforts are now underway in this area)
Search, Discovery, & Search, Discovery, & AccessAccess
Search the WebSearch the Web on your own using: on your own using: • Local Sources and Personal BookmarksLocal Sources and Personal Bookmarks • Search EnginesSearch Engines such as Google, Yahoo
Search the WebSearch the Web using using Portals and Meta Search Portals and Meta Search EnginesEngines (content through one single (content through one single interface)examples:interface)examples:• MetaSearch enginesMetaSearch engines such as Dogpile, Vivísimo, CSA
Illumina Multisearch, Elsevier’s Scirus• Directory of Open Access JournalsDirectory of Open Access Journals - A service that
covers free, full text, quality controlled scientific and scholarly journals http://www.doaj.org/
Search, Discovery, & Search, Discovery, & AccessAccess
Portals and Meta Search EnginesPortals and Meta Search Engines examples examples cont.:cont.:• CODATA/ICSTI PrototypeCODATA/ICSTI Prototype Portal on Permanent
Access to Scientific Data and Information http://www.nap.edu/shelves/codata/index.html
• Science.govScience.gov - a gateway to authoritative selected science information of U.S. Government agencies. http://www.science.gov/
• Online JOurnals Search EngineOnline JOurnals Search Engine www.ojose.com
Note: , SearchEngineWatch,Science Search Engines is a Note: , SearchEngineWatch,Science Search Engines is a good reference sourcegood reference source
Search, Discovery, & Search, Discovery, & AccessAccess
Community of ScienceCommunity of Science search engine to find home pages and contact addresses for scientists.
ISI Web of KnowledgeISI Web of Knowledge ElsevierElsevier
• ScienceDirect• Engineering Information
– Engineering Village 2• ChemVillage
CAS SciFinderCAS SciFinder FAOSTATFAOSTAT an on-line and multilingual database currently
containing time-series records covering international statistics
Search, Discovery, & Search, Discovery, & AccessAccess
Enterprise Servers Enterprise Servers - - collectively serve the needs of an enterpriseenterprise rather than a single user.• MarkLogic’s XML Content Server
• IXIASOFT’s TEXTML Server XML Content Server
PersistencePersistence
Uniform Resource NameUniform Resource Name - URNs are - URNs are persistent and unique identifiers of Internet persistent and unique identifiers of Internet documents. URNs are globally unique, documents. URNs are globally unique, persistent, and accessible over the network.persistent, and accessible over the network.
Uniform Resource IdentifierUniform Resource Identifier (URI) a compact (URI) a compact sequence of characters that identifies an sequence of characters that identifies an abstract or physical resource. abstract or physical resource.
PersistencePersistence
ARK (Archival Resource Key)ARK (Archival Resource Key) is a system primarily is a system primarily designed for custodians of archived digital objectsdesigned for custodians of archived digital objects
The The Handle SystemHandle System generally conforms with the URN generally conforms with the URN framework. The DOI system of unique identifiers is framework. The DOI system of unique identifiers is based on the Handle Systembased on the Handle System
Persistent Uniform Resource LocatorPersistent Uniform Resource Locator (PURL) was (PURL) was developed and implemented by OCLC. Intended as an developed and implemented by OCLC. Intended as an interim system to be used until the URN framework is interim system to be used until the URN framework is
well established.well established. • Source: The National Library of Australia's Preserving Access
to Digital Information (PADI)
PresentationPresentation
Traditional Paper and Ink (Meta-) Markup Traditional Paper and Ink (Meta-) Markup LanguagesLanguages - -"meta" languages because they are used for defining markup languages. • SGML (Standard Generalized Markup Language)
ISO 8879:1986
• HTMLHTML (HyperText Markup Language)
• XMLXML (eXtensible Markup Language - ISO 15022). XML has been designed for ease of implementation and for interoperability with both SGML and HTML. Flynn, P (Ed.), The XML FAQ v.4.41, Cork, 2006-01-07, http://xml.silmaril.ie/
PresentationPresentation
PaperPaper - - PDF/Adobe Acrobat, Word Digital FormatDigital Format
See ISO Committee JTC 1/SC 29 “Coding of audio,
picture, multimedia and hypermedia information”• Moving Picture Experts Group (MPEG)Moving Picture Experts Group (MPEG) ISO 11172
• Joint Photographic Experts Group (JPEG) and JPEG File Joint Photographic Experts Group (JPEG) and JPEG File Interchange Format (JFIF)Interchange Format (JFIF) ISO/IEC 15444-12:2005
http://www.jpeg.org/index.html
• Graphic Interchange Format (GIF)Graphic Interchange Format (GIF) can be displayed on almost all web browsers
• Real AudioReal Audio - a proprietary audio format
Maintenance & Maintenance & PreservationPreservation
Access ControlAccess Control• Intellectual Property Notices• Handles System
Open Archival Information SystemOpen Archival Information System (OAIS) (OAIS) Reference ModelReference Model ( (ISO 14721:2003)ISO 14721:2003)
XML XML (ISO 15022)(ISO 15022) URL/URI systemsURL/URI systems
• Version Control Z39.50 Z39.50 (ISO 23950:1998) (ISO 23950:1998) Presentation softwarePresentation software
ConclusionConclusion
““the beauty of standards is that there are so the beauty of standards is that there are so many to choose from”many to choose from”
Interoperability is not the same as standards.Interoperability is not the same as standards. Interoperability does not assure accessInteroperability does not assure access The Information ContinuumInformation Continuum means
considering both the near-term and the long-near-term and the long-termterm AND multiple presentation formsmultiple presentation forms
““A little knowledge is a dangerous thing, but it A little knowledge is a dangerous thing, but it is not half so bad as a lot of ignorance.”is not half so bad as a lot of ignorance.”
Terry Pratchett, “Equal Rites”
Information and Information and Knowledge Transfer Is the Knowledge Transfer Is the Raison d'êtreRaison d'être
WisdomWisdom is the result of learning and using is the result of learning and using knowledge for a strategic advantage. After knowledge for a strategic advantage. After gaining knowledge, wisdom is used to meet gaining knowledge, wisdom is used to meet new situations. Wisdom resides in the minds new situations. Wisdom resides in the minds of the users.of the users.
» Stuhlman Management Consultants, Knowledge Management Terms
KnowledgeKnowledge can be conveyed, but not wisdom. can be conveyed, but not wisdom.» Siddhartha by Hermann Hesse