26
Kalampokis, Roberts, Karamanou, Tambouris, Tarabanis, Hermans Challenges on Developing Tools for Exploi=ng Linked Open Data Cubes

Challenges on Developing Tools for Exploiting Linked Open Data Cubes

Embed Size (px)

Citation preview

Page 1: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

Kalampokis,  Roberts,  Karamanou,  Tambouris,  Tarabanis,  Hermans  

Challenges  on  Developing  Tools  for  Exploi=ng  Linked  Open  Data  

Cubes  

Page 2: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

   

Bill  Roberts    

@billroberts  hCp://swirrl.com  

2  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 3: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

3  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 4: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

4  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 5: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Central  Sta=s=cs  Office,  Ireland  § Department  for  Communi=es  and  Local  Government,  UK  § Sta=s=cs  Office  of  Flanders,  Belgium  

5  

Pilot  partners:  government  sta=s=cs  publishers  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 6: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

     

hCp://www.opencube-­‐project.eu  hCp://www.opencube-­‐toolkit.eu  

6  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 7: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

 

7  

Publishing  Pla^orms  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 8: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ hCp://opendatacommuni=es.org  § hCp://sta=s=csbeta.com  § hCp://data.hampshirehub.net  § hCp://data.surreycc.gov.uk  § hCp://gmdatastore.org.uk  

8  

Some  examples  of  PublishMyData  in  use  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 9: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Understanding  the  'shape'  of  the  data  § Selec=ng  the  slice(s)  you  want  § Viewing  it  easily  § Expor=ng  it  easily  § Accessing  via  API  (with/without  SPARQL)  § Combining  data  together  from  different  datasets,  different  publishers  § Knowing  whether  and  how  to  aggregate  it  

9  

Challenges  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 10: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Graher:    hCp://graher.org  § JSONstat2qb  § R2RML  

10  

Producing  RDF  Data  Cube  data  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 11: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Design  issues:  user  interfaces  and  user  experience  

§ Standardisa=on  issues:  describing  the  data  to  maximise  interoperability  

11  

Challenges  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 12: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Analysts  and  researchers  

§ Informa=on  seekers  

§ Developers  of  visualisa=ons  and  applica=ons  

12  

What  kind  of  users?  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 13: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

13  

hCp://digitalpublishing.ons.gov.uk/2014/04/02/the-­‐persona-­‐touch/  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 14: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

14  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 15: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Possibly  lots  of  dimensions  § Possibly  long  lists  of  possible  values  § Possibly  'sparse'  cubes  § Ensuring  good  performance  of  tools  even  with  large  data  collec=ons  

15  

Understanding  the  shape  of  the  data  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 16: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

16  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 17: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ CSV  and  Excel  downloads  of  extracts  § R  § Visualisa=on  libraries:  d3.js,  leaflet.js,  Google  maps/charts,  Tableau…  

17  

Play  nicely  with  other  tools  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 18: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ ui:sortPriority  -­‐  other  op=ons?  § skos:broader  and  skos:narrower  are  not  enough  § è  XKOS  

§  Levels  § Knowing  whether  a  hierarchy  is  exhaus=ve  and  or  exclusive  § Hierarchies  change  over  =me  –  e.g.  administra=ve  geography  

18  

Ordering  and  hierarchy  of  code  lists  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 19: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

19  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 20: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Metadata  to  indicate  'aggregatability'  § XKOS  to  describe  hierarchies  

§ Combine  the  sta=s=cal  data  with  external  reference  and  structural  data  

§ Ra=os  –  link  to  numerator  and  denominator  observa=ons  

20  

Aggrega=on  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 21: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ A  denser  network  of  interlinks  § BeCer  discovery  of  re-­‐usable  code  lists  and  ontologies  § Auto-­‐processing  of  equivalent  concepts  § Different  approaches  to  measure  proper=es:  

§ numberOfPeopleWithDemen=aInLondon  § numberOfPeopleWithDemen=a  (plus  refArea  =  London)  § numberOfPeople  (refArea=London,  condi=on=demen=a)  § number  (unitMeasure=People,  refArea=London,  condi=on=demen=a)  § obsValue  

§ Is  there  a  shortlist  of  standard  measures  that  would  be  useful?  

21  

Improving  interoperability  of  data  cubes  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 22: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ Several  choices  for  represen=ng  measures  § Aggrega=on  § Hierarchical  code  lists  

§ Choices/paCerns  for  'where  to  put  the  seman=cs'  –  measure,  unit,  dimensions  

§ Recommend  use  of  XKOS?  § Metadata  for  aggregatability  (including  ra=os)  

22  

What's  missing  from  the  RDF  Data  Cube  vocabulary?  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 23: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

23  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 24: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

24  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 25: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

§ We  s=ll  love  Linked  Data  and  RDF  Data  Cube!  § We've  persuaded  some  sta=s=cians  to  love  it  too  § Understand  the  audience  and  design  for  them  § Opportuni=es  for  improved  standardisa=on  and  guidance  

25  

Conclusion  

19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014  

Page 26: Challenges on Developing Tools for Exploiting Linked Open Data Cubes

                               

Thanks!    

[email protected]  [email protected]  

26  19  October  2014,  Riva  del  Garda,  Italy   ISWC  2014  –  SemStats  2014