Online info2013 reconciliation

Preview:

DESCRIPTION

 

Citation preview

Reconciling ourselves to what's out there: how one dataset talks to another

Tony HirstDept of Computing and Communications,

The Open University UKO:I

I play with other people’s

data….

Clustering and Approximate

Matching

OpenRefine.org

Metaphone3 (soundalike)

metaphone( 'Epic Garments Limited’)EPKKRMNTSLMTT

metaphone( 'EPOCH GARMENTS LTD’)EPXKRMNTSLTT

Metaphone

Levenshtein (edit distance)

You know computers can do this anyway…

..it’s just that no-one’s told you how you can

do it on your computer with your data…

Reconcile your data

http://schoolofdata.org/2013/10/18/in-support-of-the-bangladeshi-garment-industries-data-expedition/

http://bit.ly/ScoDa-bg-reconcile

opencorporates.com

http://opencorporates.com/reconcile

cell.recon.match.name

cell.recon.match.id

In this way, we can make our data linkable…

Reconcile your data with what’s

out there

And why not have a go at

clustering too…?

Can you match your

data to itself?

O:iblog.ouseful.info

@psychemedia

Recommended