2

Click here to load reader

NIC and the semantic web briefing paper

Embed Size (px)

DESCRIPTION

This briefing paper provides a high-level overview of the main elements of the semantic web and the NHS National Innovation Centre's approach to developing its LInked Data initiative

Citation preview

Page 1: NIC and the semantic web briefing paper

CERTUS TECHNOLOGY NHS NATIONAL INNOVATION CENTRE

THE SEMANTIC WEB 23 R D SEPTEMBER 2009

1

NHS NATIONAL INNOVATION CENTRE

NIC AND THE SEMANTIC WEB

Briefing Paper – September 2009

1 INTRODUCTION

New knowledge is being produced at an increasing speed. The proper management of knowledge within the NHS is a key enabler of innovation and improvement. Techniques and technologies for the management of knowledge are therefore becoming fundamental.

In this briefing we first outline the relationship between data, information and knowledge. We then describe how the Semantic Web technologies allow knowledge to be managed with the aid of information and communication technology (ICT).

2 DATA, INFORMATION AND KNOWLEDGE

The use of ICT to manage Government information came into prominence thirty years ago with the advent of large scale databases. Data of itself has no meaning; it requires context to convey information. This has historically been provided within the structures of databases and by its presentation in the systems they support. Therefore, if data is removed from its ICT context, then its information content is compromised. To become knowledge, data must carry with it, its unambiguous and universal interpretation. Semantic technologies make this possible.

In an information society the preservation of increasing amounts of information is a major cost, and its potential loss an increasing risk. Loss can occur simply because software applications are no longer maintained, or data formats no longer supported. Without action we would eventually face an ‘information black hole’.

In a modern knowledge-based economy the effective management of knowledge offers a competitive advantage. The Semantic Web can enable this management.

3 LINKING OF DATA

Linked Data is about moving towards a Semantic Web in which related data is connected by virtue of its meaning. This requires no a priori structuring, as in portals; only embedded meaning. It can be achieved simply by adding tags from an agreed vocabulary. For example, a vocabulary for infectious diseases at www.biotags.org may unambiguously define the tag virus. Any web page about viruses can embed this tag, semantically linking it to all similarly tagged pages.

Page 2: NIC and the semantic web briefing paper

CERTUS TECHNOLOGY NHS NATIONAL INNOVATION CENTRE

THE SEMANTIC WEB 23 R D SEPTEMBER 2009

2

Each such knowledge resource is given a unique web identifier, or URI (uniform resource identifier). This allows different data sets to be created in full confidence that two independent sources are referring to exactly the same resource (for example the same virus) irrespective of their own terminology. In more advanced forms of linked data, knowledge is expressed in standard languages and conforms to standards for accessing information about those resources.

Linking Open Data is a government initiative for opening government knowledge resources in this linked and standardised way, encouraging a marketplace in web applications to exploit the resources and offer new services to fuel the knowledge economy and support innovation.

4 ONTOLOGIES

Shared knowledge requires more powerful representation mechanisms than simple vocabularies: adding semantics to vocabularies is the purpose of an ontology.

The term ontology was originally used in a branch of philosophy (metaphysics). In this discipline, entities are ordered according to their similarities and differences in categories (also known as concepts) with the resulting conceptual arrangements named ontologies. However, in philosophy categorisations are a prized end point; in Computer Science they are used to add meaning to both tags and to the relationships between tags.

For example an ontology might define a virus and an infectious disease, and relate these by asserting that a virus is a special kind of infectious disease.

5 REASONING

Ontologies can support reasoning. For example, if the description of an innovation that controls infectious disease includes tags from a suitable ontology, it is possible to automatically infer that it might control viruses.

Ontologies that support reasoning are primarily being developed within the natural sciences. Their development is complex and demanding, and only where real advantage can be gained through reasoning can the effort be justified.

6 JOINING THE SEMANTIC WEB

Moving from an infrastructure driven by software applications to one driven by semantic data offers substantial advantages. This is potentially of huge importance within the innovation sphere where, through shared knowledge, stakeholders with needs can more readily connect with innovators.

If Government data is tagged according to available ontologies the latent knowledge is freed for use across government and beyond. This is the vision for the Semantic Web: where the internet becomes true knowledge resource.