47
Linked Data Opportunities and Risks Work distributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0 Oscar Corcho, Boris Villazón-Terrazas, Asunción Gómez- Pérez Facultad de Informática, Universidad Politécnica de Madrid Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net {ocorcho,bvillazon,asun}@fi.upm.es @ocorcho, @linkeddataspain Acknowledgements: Luis M. Vilches, Victor Saquicela, Guillermo Alvaro Rey, Olaf Hartig, Juan Sequeda, and many others that we may have omitted. Available at: http://www.slideshare.net/ocorcho/

Linked Data and Public Administration

Embed Size (px)

DESCRIPTION

An English translation of the slideset about LInked Data and Public Administration

Citation preview

Page 1: Linked Data and Public Administration

Linked DataOpportunities and Risks

Work distributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0

Oscar Corcho, Boris Villazón-Terrazas, Asunción Gómez-Pérez

Facultad de Informática, Universidad Politécnica de Madrid

Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid

http://www.oeg-upm.net

{ocorcho,bvillazon,asun}@fi.upm.es

@ocorcho, @linkeddataspain

Acknowledgements: Luis M. Vilches, Victor Saquicela, Guillermo Alvaro Rey, Olaf Hartig, Juan Sequeda, and many others that we may have omitted.

Available at: http://www.slideshare.net/ocorcho/

Page 2: Linked Data and Public Administration

Public Administration and Linked Data:

Opportunities and Risks

Work distributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0

Oscar Corcho, Boris Villazón-Terrazas, Asunción Gómez-Pérez

Facultad de Informática, Universidad Politécnica de Madrid

Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid

http://www.oeg-upm.net

{ocorcho,bvillazon,asun}@fi.upm.es

@ocorcho, @linkeddataspain

Acknowledgements: Luis M. Vilches, Victor Saquicela, Guillermo Alvaro Rey, Olaf Hartig, Juan Sequeda, and many others that we may have omitted.

Available at: http://www.slideshare.net/ocorcho/

Page 3: Linked Data and Public Administration

Content

• Setting the context of Open Government Data• Legal framework• Challenges, opportunities and limitations

• Linked Data• Background• Principles and technologies• Linked Open Data

• Linked Open Government Data• In the world and in Spain

• RD 1495/2011 revisited• Take home message

3

Page 4: Linked Data and Public Administration

Setting the context of this talk…

• The dates for this workshop couldn’t be better…• BOE, Tuesday, November 8th 2011 (2 days ago!!)• http://boe.es/boe/dias/2011/11/08/pdfs/BOE-A-2011-17560.pdf• Real Decreto 1495/2011, de 24 de octubre, por el que se

desarrolla la Ley 37/2007, de 16 de noviembre, sobre reutilización de la información del sector público, para el ámbito del sector público estatal

• Some questions that I would like to explore today…• What does it mean, in terms of cost and effort, for a public

administration? • Which are the societal and technological challenges associated to

this? • Which are the main opportunities for public administrations,

companies and researchers?• How do I ensure that my data are used properly, and propertly

acknowledged?

4

Page 5: Linked Data and Public Administration

Marco legal e iniciativas Open Data

• Open Access Initiative (2001)• Información científica en la red; > 510 organizaciones

• Convención de Aarhus (1998)• Derecho de participación y acceso; 41 países y la UE

• Directiva PSI• Reutilización de la PSI

• Convención sobre el acceso a documentos oficiales (2009)• Firmada por 12 países• Bélgica, Finlanda, Noruega, Suecia, Hungría, Estonia, Lituania, Eslovenia, Georgia,

Montenegro, Serbia y Macedonia

• Ley 37/2007. Reutilización de la PSI• Ley 11/2007. Acceso de los ciudadanos a los servicios públicos, y Derecho a la

calidad de los servicios• RD 4/2010 Esquema Nacional de Interoperabilidad

• Estándares abiertos• Principio de neutralidad tecnológica• Software de fuentes abiertas

• RD 1495/2011 Desarrolla la Ley 37/2007

Adaptado de: Antonio Rodríguez Pascual (IGN)

Page 6: Linked Data and Public Administration

Ley 37/2007 y RD 1495/2011 Reutilización Datos Públicos

Page 7: Linked Data and Public Administration

Open Data y Open Government

• 8/11/2011 - http://www.deri.ie/about/open-data

Fuente: Antonio Rodríguez Pascual (IGN)

Page 8: Linked Data and Public Administration

¿Cómo publicar datos (en la red)?

• 1) En un tablón de anuncios• Para los que tienen mucho tiempo libre

• 2) En una página • Para usuarios humanos

• 3) En un fichero• Para ser cargados en un Sistema de Información (XML, HTML,

CSV, etc.)• Con suerte, no es un PDF escaneado

• 4) Mediante un servicio web• Para ser consultados por SI y personas• Permite generar servicios de valor añadido• Integrarlo en la lógica de la aplicación del usuario

Adaptado de: Antonio Rodríguez Pascual (IGN)

Page 9: Linked Data and Public Administration

Classic Web

9

AEMETWeatherStations

INE production

DB

Data exposed to the Web via

HTML, pdf, etc.

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Page 10: Linked Data and Public Administration

Classic Web

10

Information from single pages

can be found via search engines

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Page 11: Linked Data and Public Administration

11

Classic Web. Limitations

• La publicación de datos se puede hacer en páginas HTML, ficheros (CSV, HTML, XML, etc.), o servicios

• Limitaciones• Los datos no están enlazados y no están siempre preparados para

la Web• Los datos deben ser obligatoriamente descargados para poderlos

consumir (cuando son muy grandes generan problemas)• Estos datos son difíciles de integrar si proceden de la misma o

distintas instituciones• Ejemplo: Rioja, La frente a La Rioja en algunos campos• Ejemplo: código INE frente a código IGN frente a códigos de

Catastro

• Un trabajador de la sociedad del conocimiento (periodista, político, analista, etc.):• ¿Hay correlación entre cuánto llovió este año en Adeje, el número

de turistas recibidos y la evolución de la tasa de desempleo?

Page 12: Linked Data and Public Administration

Classic Web

12

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Complex queries over multiple pages / data

sources?

Page 13: Linked Data and Public Administration

What do we actually want?

• Use the Web like a single global database• Move from a Web of documents to a Web of Data

• Move from our data catalogues and “virtual offices” to a Web of Data

13

INE Production database

AEMETWeatherStations

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Page 14: Linked Data and Public Administration

Ejemplos con datos reales…

• ¿Cuántos embalses hay en España?select COUNT(distinct ?x) WHERE {?x a <http://geo.linkeddata.es/ontology/Embalse>}

1644

• ¿Cuál es el paro en la provincia de Madrid en el 2003?prefix geoes: http://geo.linkeddata.es/ontology/prefix geoesres: <http://geo.linkeddata.es/resource/Provincia/>prefix scv: <http://purl.org/NET/scovo#>SELECT ?value where {?i a scv:Item .?i scv:dimension geoesres:Madrid .?i rdf:value ?value .?i scv:dimension <http://geo.linkeddata.es/resource/A%C3%B1o/2003> .}

40765514

Page 15: Linked Data and Public Administration

Content

• Setting the context of Open Government Data• Legal framework• Challenges, opportunities and limitations

• Linked Data• Background• Principles and technologies• Linked Open Data

• Linked Open Government Data• In the world and in Spain

• RD 1495/2011 revisited• Take home message

15

Page 16: Linked Data and Public Administration

Linked Data enables such Web of Data

16

INE Production Database

AEMET Weather Stations

Global Identifier: URI (Uniform Resource Identifier), which is a string of characters used to identify a name or a resource on the Internet.

http://datos.aemet.../Adejehttp://datos.ine.es/…./Adeje

Data Model: RDF (Resource Description Framework), which is a standard model for data interchange on the Web

http://.../hasMeasurement

http://.../name

34

“Desempleo en Adeje”

Access Mechanism: HTTP

Connection: Typed Links

http://.../hasWeatherStation

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Page 17: Linked Data and Public Administration

¿Qué es la Web de Linked Data?

• Una extensión de la Web actual donde se publican datos de acuerdo a cuatro principios (a modo de best practice)• http://www.w3.org/DesignIssues/LinkedData.html

• Se utilizan URIs para hacer referencia a cosas (estación meteorológica, observación, punto de interés, embalse, etc.)• http://aemet.linkeddata.es/resource/WeatherStation/id08363• http://geo.linkeddata.es/resource/Embalse/Burguillo%2C%20Embalse

%20del

• Se usa el protocolo HTTP para acceder a la información de las URIs

• Cuando se obtienen datos de una URI o se utiliza un lenguaje de consulta (SPARQL), se obtienen datos en un formato estándar (RDF)

• Se incluyen enlaces a otras URIshttp://www.ted.com/talks/tim_berners_lee_on_the_next_web.html

Page 18: Linked Data and Public Administration

18

RDF and RDF Schema

• W3C recommendations

Database XML RDF(S)

Schema

Data

RDF Schema

RDF

Page 19: Linked Data and Public Administration

19

RDF – Resource Description Framework

• RDF is a basic language to express data and metadata• Statements are represented as triples, consisting of a

subject, predicate, and object [S,P,O]

Subject Objectproperty

Statement

ign:LaLaguna ign:SantaCruzdeTenerife

ign:Adeje 1.027.914

geo:formaParteDe dbpedia:población

“San Cristobal de la Laguna”

rdfs:labelgeo:formaParteDe

Page 20: Linked Data and Public Administration

SPARQL

• Query: “Tell me the municipalities that belong to the province of Santa Cruz de Tenerife”

SELECT ?s

WHERE { ?s geo:formaParteDe ign:SantaCruzdeTenerife.}

• Result: ign:LaLaguna and ign:Adeje

?s ign:SantaCruzdeTenerifegeo:formaParteDe

20

ign:LaLaguna ign:SantaCruzdeTenerife

ign:Adeje 1.027.914

geo:formaParteDe dbpedia:población

“San Cristobal de la Laguna”

rdfs:labelgeo:formaParteDe

Page 21: Linked Data and Public Administration

Linked Open Data evolution

21

2007

2008

2009

2010

Page 22: Linked Data and Public Administration

Linked Open Data

22

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

2011

Page 23: Linked Data and Public Administration

Linked Open Data – Some Spanish Datasets

23

Page 24: Linked Data and Public Administration

So does that mean I have to publish my data as Linked Data, now?

• But, why?

24

• What was your incentive to publish an HTML page in 1990?• Share data in documents and because your neighbor

was doing it• So, why should we publish Linked Data in 2011?

• Share data as data and because your neighbor is doing it

• Because my Government tells me to do it

© Slide adapted from “Introduction to Linked Data”- Juan Sequeda

Page 25: Linked Data and Public Administration

Content

• Setting the context of Open Government Data• Legal framework• Challenges, opportunities and limitations

• Linked Data• Background• Principles and technologies• Linked Open Data

• Linked Open Government Data• In the world and in Spain

• RD 1495/2011 revisited• Take home message

25

Page 26: Linked Data and Public Administration

Open Government Initiatives

• W3C eGovernment Activity• Improving Access to Government through Better Use of the Web• Publishing Open Government Data• W3C Government Linked Data WG

• Open Knowledge Foundation• Open Data Manual

• 5-star deployment scheme for Linked Open Government Data

26

Page 27: Linked Data and Public Administration

Open Government. USA and UK

27

TOP-DOWN

BOTTOM-UP

Page 28: Linked Data and Public Administration

Linked Data Mashup (data.gov)

• Clean Air Status and Trends (CASTNET)• http://data-gov.tw.rpi.edu/demo/exhibit/demo-8-castnet.php

Page 29: Linked Data and Public Administration

Linked Data Mashup (data.gov.uk)

• Research Funding Explorer• http://bis.clients.talis.com/

Page 30: Linked Data and Public Administration

30

Linked Data in the UK

• Education• http://education.data.gov.uk/id/school/106661

• Parliament• http://parliament.psi.enakting.org/id/member/1227

• Maps• E.g., London:

http://data.ordnancesurvey.co.uk/id/7000000000041428• http://map.psi.enakting.org

• Transport• http://www.dft.gov.uk/naptan/

• SameAs service• http://www.sameas.org

• Challenges• http://gov.tso.co.uk/openup/sparql/gov-transport

Page 31: Linked Data and Public Administration

[Linked] Open Data en España

31Fuente: Carlos de la Fuente (CTIC)

Page 32: Linked Data and Public Administration

[Linked] Open Data en España. Transparencia

32Fuente: Carlos de la Fuente (CTIC)

Page 33: Linked Data and Public Administration

http://geo.linkeddata.es

33

1. Specification 2. Modelling

3. Generation4. Publication & Exploitation

Page 34: Linked Data and Public Administration

http://cultura.linkeddata.es/visualizer

34

1. Specification 2. Modelling

3. Generation 4. Publication&Exploitation

MARC 21 XML records

Page 35: Linked Data and Public Administration

http://aemet.linkeddata.es/visualizer

35

1. Specification 2. Modelling

3. Generation4. Publication & Exploitation

Python scritps

250 weather stations (pressure, humidity, etc)

Data from the stations in CSV files in a FTP server

Page 36: Linked Data and Public Administration

http://webenemasuno.linkeddata.es

36

1. Specification 2. Modelling

3. Generation4. Publication & Exploitation

Scenario in the context of tourism and travelling, where the content is aggregated from different platforms.

Heterogeneous content (images, travel guides, posts, videos, news)

Page 37: Linked Data and Public Administration

Content

• Setting the context of Open Government Data• Legal framework• Challenges, opportunities and limitations

• Linked Data• Background• Principles and technologies• Linked Open Data

• Linked Open Government Data• In the world and in Spain

• RD 1495/2011 revisited• Take home message

37

Page 38: Linked Data and Public Administration

Let’s analyse the RD 1495/2011. Documents

38Asunción Gómez Pérez

Metadatos generales el documentodc:titledc:authordc:description…

Datos del documento

Page 39: Linked Data and Public Administration

Identificadores. Adendo a RD 4/2010

39

http://www.cabinetoffice.gov.uk/media/301253/public_sector_uri.pdf

Page 40: Linked Data and Public Administration

Licencias

40Asunción Gómez Pérez 40Asunción Gómez Pérez

Page 41: Linked Data and Public Administration

Other elements

41Asunción Gómez Pérez

Page 42: Linked Data and Public Administration

Content

• Setting the context of Open Government Data• Legal framework• Challenges, opportunities and limitations

• Linked Data• Background• Principles and technologies• Linked Open Data

• Linked Open Government Data• In the world and in Spain

• RD 1495/2011 revisited• Take home message

42

Page 43: Linked Data and Public Administration

Take home message

• Opening data is an opportunity…• To increase interoperability inside and outside your organisation• To increase transparency• To increase productivity, avoiding efforts and costs for your

companies to make use of your data• To boost creativity among your citizens and companies

• Open data is a must…• Laws are now enforcing it and more and more organisations are

joining this club• Linked Open Government Data is one of the best options to

open your data• Standardised formats• Ease of use for developers (infomediarios)• It does not replace what you have. It adds on it

43Asunción Gómez Pérez

Page 44: Linked Data and Public Administration

Take home message

• Open data has important risks…• Specially if you don’t do it ;-)• Your administration will continue to be expensive• Your citizens will start requesting open data• Your businesses will not grow and be competitive

44Asunción Gómez Pérez

Page 45: Linked Data and Public Administration

Public Administration and Linked Data:

Opportunities and Risks

Work distributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0

Oscar Corcho, Boris Villazón-Terrazas, Asunción Gómez-Pérez

Facultad de Informática, Universidad Politécnica de Madrid

Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid

http://www.oeg-upm.net

{ocorcho,bvillazon,asun}@fi.upm.es

@ocorcho, @linkeddataspain

Acknowledgements: Luis M. Vilches, Victor Saquicela, Guillermo Alvaro Rey, Olaf Hartig, Juan Sequeda, and many others that we may have omitted.

Available at: http://www.slideshare.net/ocorcho/

Page 46: Linked Data and Public Administration

46

(@linkeddataspain ,http://red.linkeddata.es/)• Facilitar el intercambio y transferencia de conocimientos• Aumentar la visibilidad internacional de la investigación española• Aumentar la cohesión interna y explorar sinergias (más de 300 personas)

• Solicitar nuevos proyectos• Unir esfuerzos en proyectos en curso• Evangelizar a la industria, a las Administraciones Públicas y a otros grupos de

investigación

• Instalación y mantenimiento de infraestructura• Listas de correo (https://listas.fi.upm.es/mailman/listinfo/redlinkeddata), website, blog,

repositorios y hosting de datos (linkeddata.es), software y material docente.

• Creación de itinerarios formativos• Fomento de la movilidad de investigadores• Organización de eventos

• Reuniones plenarias y workshops• Talleres y cursos de formación• voCamps temáticos, Linked Data meetups y desayunos de trabajo

Page 47: Linked Data and Public Administration

47

Asociación Española de Linked Data