1
Standards Update Linked Data: What Is It and What Can It Do? Mark H. Needleman, Column Editor There has been a lot of talk, discussion (and even buzz) on the Internet and in the library world recently on the whole issue of linked data and what it may mean, accomplish, and even change in the Internet and library worlds. This article will attempt to clarify what linked data is, some of the issues surrounding it, and explore some possible implications. Serials Review 2011; 37:234. © 2011 Elsevier Inc. All rights reserved. Linked Data In a paper entitled Linked DataThe Story So Far,linked data is described as follows: The term Linked Data refers to a set of best practices for publishing and connecting structured data on the Web.1 The paper also outlines the principles of linked data design: Use URI to indentify things; Use HTTP URIs (uniform resource identifier) so that these things can be referred to and looked up (dereferenced) by people and user agents; Provide useful information about the thing when its URI is dereferenced, using standard formats such as RDF/XML; and Include links to related URIs in the exposed data to improve discovery of other related information on the Web. To put it simply, linked data is about using the Web to create typed links between data from different sources. Linked data refers to data published on the Web in such a way that it is machine- readable, its meaning is explicitly defined, it is linked to other external datasets, and can in turn be linked to from external datasets. There are detailed technical issues underpinning linked data that are too involved to go over in the space allowed here. However, descriptions of the technologies involved in linked data can be found in Linked DataThe Story So Far.To examine linked data from a library perspective, it should be noted that most (if not all) modern online public access catalogs (OPAC) provide some aspect of linked data in that record displays create links from such fields as author, subject headings and many others that the user can click on. However the links (usually) are only to searches in the OPAC and not to external resources. With a linked data implementation those links would provide the ability to access resources beyond what is available in the OPAC itself. In addition, the links tend to be uni-directionaloutbound from the OPAC, but basically to other data residing in the OPAC itself. A true linked data environment would provide the ability for external resources to provide inbound links to those fields in the OPAC record. Many interesting issues surround linked data, some of which are still research questions. Among them are: Link maintenanceThe content of linked data sources, like other URIs on the Web, can change. Keeping links valid is currently a challenge and will become more of one as the use of linked data grows. Mechanisms will be needed that can automatically check and update links. Such mechanisms exist today, but the question of whether they will be robust enough or whether new mechanisms will be needed is very much an open question; LicensingInevitably some data accessed through linked data mechanisms will have license restrictions attached to it. How the user interface will deal with data that are not accessible by a particular user is an issue for user interfaces like Web browsers; Trust, quality, and relevanceA significant consideration for linked data application is how to ensure the data most relevant or appropriate to the user's needs are identified and made available. For example, in scenarios where data quality and trustworthiness are paramount, how can this be determined heuristically, partic- ularly where the data set may not have been encountered previously?; and User interfaces and interface designOne of the key benefits of linked data from the user perspective is the provision of integrated access to data from a wide range of distributed and heterogeneous data sources. This may involve integration of data from sources not explicitly selected by the user. Challenges remain in understanding the user interaction paradigm for applications built on data assembled dynamically in this fashion. While many, if not all, of these issues and changes exist in today's Web environment, the development of linked data and applications that make use of it will increase these challenges in the future. Notes [1] Christian Bizer, Tom Heath and Tim Berners-Lee, Linked DataThe Story So Far,http://eprints.ecs.soton.ac.uk/21285/1/bizer-heath-berners-lee-ijswis- linked-data.pdf (accessed May 22, 2011). Resources Berners-Lee, Tim. Tim Berners-Lee on the Next Web.Presentation given at the 2009 TED Conference. http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html (accessed May 22, 2011). Wikipedia, The Free Encyclopedia. Linked Data.http://en.wikipedia.org/wiki/ Linked_Data (accessed May 22, 2011). Needleman is SFX/Metalib Support Specialist, Florida Center for Library Automation, Gainesville, FL 32606, USA; e-mail: [email protected]. 0098-7913/$ see front matter © 2011 Elsevier Inc. All rights reserved. doi:10.1016/j.serrev.2011.06.005 234

Linked Data: What Is It and What Can It Do?

Embed Size (px)

Citation preview

Standards Update

Linked Data: What Is It and What Can It Do?

Mark H. Needleman, Column Editor

Needleman is SFX/MetalGainesville, FL 32606, US

0098-7913/$ – see frontdoi:10.1016/j.serrev.201

There has been a lot of talk, discussion (andevenbuzz) on the Internet and in the libraryworld recentlyon the whole issue of linked data and what it may mean, accomplish, and even change in the Internetand library worlds. This article will attempt to clarify what linked data is, some of the issuessurrounding it, and explore some possible implications. Serials Review 2011; 37:234.© 2011 Elsevier Inc. All rights reserved.

Linked Data

In a paper entitled “Linked Data—The Story So Far,” linked data isdescribed as follows: “The term Linked Data refers to a set of bestpractices for publishing and connecting structured data on theWeb.”1 The paper also outlines the principles of linked data design:

• Use URI to indentify things;• Use HTTP URIs (uniform resource identifier) so that these thingscan be referred to and looked up (“dereferenced”) by people anduser agents;

• Provide useful information about the thing when its URI isdereferenced, using standard formats such as RDF/XML; and

• Include links to related URIs in the exposed data to improvediscovery of other related information on the Web.

To put it simply, linked data is about using the Web to createtyped links between data from different sources. Linked data refersto data published on the Web in such a way that it is machine-readable, its meaning is explicitly defined, it is linked to otherexternal datasets, and can in turn be linked to from externaldatasets. There are detailed technical issues underpinning linkeddata that are too involved to go over in the space allowed here.However, descriptions of the technologies involved in linked datacan be found in “Linked Data—The Story So Far.”

To examine linked data from a library perspective, it should benoted that most (if not all) modern online public access catalogs(OPAC) provide some aspect of linked data in that record displayscreate links from such fields as author, subject headings and manyothers that theuser can clickon.However the links (usually) are onlyto searches in the OPAC and not to external resources. With a linkeddata implementation those links would provide the ability to accessresources beyond what is available in the OPAC itself. In addition,the links tend to be uni-directional—outbound from the OPAC, butbasically to other data residing in the OPAC itself. A true linked dataenvironment would provide the ability for external resources toprovide inbound links to those fields in the OPAC record.

ib Support Specialist, Florida Center for Library Automation,A; e-mail: [email protected].

matter © 2011 Elsevier Inc. All rights reserved.1.06.005

234

Many interesting issues surround linked data, some of whichare still research questions. Among them are:

• Link maintenance—The content of linked data sources, like otherURIs on the Web, can change. Keeping links valid is currently achallenge and will become more of one as the use of linked datagrows. Mechanisms will be needed that can automatically checkand update links. Suchmechanisms exist today, but the question ofwhether they will be robust enough or whether newmechanismswill be needed is very much an open question;

• Licensing—Inevitably some data accessed through linked datamechanisms will have license restrictions attached to it. How theuser interface will deal with data that are not accessible by aparticular user is an issue for user interfaces like Web browsers;

• Trust, quality, and relevance—A significant consideration for linkeddata application is how to ensure the data most relevant orappropriate to the user's needs are identified and made available.For example, in scenarios where data quality and trustworthinessare paramount, how can this be determined heuristically, partic-ularly where the data set may not have been encounteredpreviously?; and

• User interfaces and interface design—One of the key benefits oflinked data from the user perspective is the provision of integratedaccess to data from awide range of distributed and heterogeneousdata sources. Thismay involve integration of data from sources notexplicitly selected by theuser. Challenges remain inunderstandingthe user interaction paradigm for applications built on dataassembled dynamically in this fashion.

Whilemany, if not all, of these issues and changes exist in today'sWeb environment, the development of linked data and applicationsthat make use of it will increase these challenges in the future.

Notes

[1] Christian Bizer, Tom Heath and Tim Berners-Lee, “Linked Data—The Story SoFar,” http://eprints.ecs.soton.ac.uk/21285/1/bizer-heath-berners-lee-ijswis-linked-data.pdf (accessed May 22, 2011).

Resources

Berners-Lee, Tim. “Tim Berners-Lee on the NextWeb.” Presentation given at the 2009TED Conference. http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html(accessed May 22, 2011).Wikipedia, The Free Encyclopedia. “Linked Data.” http://en.wikipedia.org/wiki/Linked_Data (accessed May 22, 2011).