66
Open Source, Open Data Kirrily Robert Florida Linux Show, 2009

Open Source, Open Data

Embed Size (px)

DESCRIPTION

My presentation from Florida Linux Show 2009. Find out how open source's principles are being used outside of software, and how open source and open data can work together to change the world.

Citation preview

Page 1: Open Source, Open Data

Open Source, Open DataKirrily RobertFlorida Linux Show, 2009

Page 2: Open Source, Open Data
Page 3: Open Source, Open Data

From Open Source to Open Data

Page 4: Open Source, Open Data

1993

Me in 1993 My Linux desktop looked like this

Page 5: Open Source, Open Data

1993

• I started using Linux in 1993

• I was very excited by it, even though it was quite primitive at the time

• Other people thought I was a little crazy

Page 6: Open Source, Open Data

Image: Wikipedia Image: Engadget

1999

Google’s servers in 1999Jar Jar in 1999

Page 7: Open Source, Open Data

1999

• By 1999 Linux + open source was starting to take off

• Companies using and building services on Linux etc.

• We were calling it “Open Source” - a more marketable term for Free Software

Page 8: Open Source, Open Data

Four Software Freedomshttp://www.gnu.org/philosophy/free-sw.html

• Freedom to run the program

• Freedom to study the program and modify it for your own use

• Freedom to redistribute verbatim copies

• Freedom to improve the program, and release your improvements

Page 9: Open Source, Open Data

Free Culture

• A similar movement

• Make cultural works freely available

• Mostly over the Internet

Page 10: Open Source, Open Data

Free Culture

Page 11: Open Source, Open Data

Free Culture

Page 12: Open Source, Open Data

Free Culture

Page 13: Open Source, Open Data

Free Culturehttp://wiki.freeculture.org/Free_Culture_Definition

• Freedom to use the work

• Freedom to study the work and to apply knowledge acquired from it

• Freedom to make and redistribute copies

• Freedom to make changes and improvements, and to distribute derivative works

Page 14: Open Source, Open Data
Page 15: Open Source, Open Data

Image: masternewmedia.org

Page 16: Open Source, Open Data

What is Open Data?

Page 17: Open Source, Open Data

Data

Page 18: Open Source, Open Data

Image: himmelskratzer @ Flickr

Page 19: Open Source, Open Data

What is data?

• Ones and zeroes (obviously)

• But also filing cabinets, research archives, and other offline resources

• It’s not OPEN data unless you can get at it

Page 20: Open Source, Open Data

Open Data Freedoms

• Freedom to use the data

• Freedom to study the data and modify it for your own use

• Freedom to make and share verbatim copies

• Freedom to improve the data and redistribute the results

Page 21: Open Source, Open Data

Data availability

• Digital

• Online

• Well formatted

Page 22: Open Source, Open Data

Open Data Projects

Page 23: Open Source, Open Data

public.resource.org

• Created 2007 by Carl Malamud

• “Making Government Information More Accessible”

Page 24: Open Source, Open Data

public.resource.org

• SEC EDGAR records

• Patents database

• Copyright database

• Congressional records

• Legal decisions

• Fedflix

Page 25: Open Source, Open Data

Data.gov

• Founded 2008

• “Increase public access to high value, machine readable datasets generated by the Executive Branch of the Federal Government.”

Page 26: Open Source, Open Data

OpenStreetMap

Page 27: Open Source, Open Data

Compare...

Page 28: Open Source, Open Data

OpenStreetMap

Page 29: Open Source, Open Data

Open Library Project

Page 30: Open Source, Open Data

• CD data

• Tracks, artists, releases...

• CC license

Page 31: Open Source, Open Data

Flickr

• Images

• Metadata• tags, timestamps, geolocations, etc.

• Range of CC licenses and permissive TOS

Page 32: Open Source, Open Data

Infochimps

• Large data sets

• Various licenses

• Tools for transformation

Page 33: Open Source, Open Data

• Open data about “everything”

• 8.5m concepts

• CC-BY license

• API and data dumps

Page 34: Open Source, Open Data

2,416,683 books

Page 35: Open Source, Open Data

16,608 ships

Page 36: Open Source, Open Data

488 cheeses

Page 37: Open Source, Open Data

Structured data { "name": "Asiago cheese" "id": "/en/asiago_cheese", "region": [{ "id": "/en/asiago", "name": "Asiago", "type" : "/location/location"

}], "source_of_milk": [{ "id": "/en/cattle", "name": "Cow", "type" : "/biology/organism_classification" }] }

Page 38: Open Source, Open Data

Open Data Apps

Page 39: Open Source, Open Data
Page 40: Open Source, Open Data
Page 41: Open Source, Open Data

• Apps for America competition

• Open source and open data

• Round 1: various data sources

• Round 2: Data.gov

Page 42: Open Source, Open Data

Legistalker

Page 43: Open Source, Open Data

Filibusted

Page 44: Open Source, Open Data

Where the money goes

Page 45: Open Source, Open Data

Open Source for Open Data

Page 46: Open Source, Open Data

What can open source do?

Input

Processing

Output

Scrape

Munge

Visualise

Page 47: Open Source, Open Data

Scraping data

• APIs• XML, RSS, JSON...

• Downloadable data sets• XML, Excel, CSV, triple dumps...

• Beautiful Soup (Python)• http://www.crummy.com/software/

Page 48: Open Source, Open Data

Munging data

• Perl• http://perl.org/

• R (statistical analysis)• http://r-project.org/

• Hadoop (parallel data processing)• http://hadoop.apache.org/

Page 49: Open Source, Open Data

Visualisations

• MIT Simile• http://simile.mit.edu/

• Processing• http://processing.org/

Page 50: Open Source, Open Data
Page 51: Open Source, Open Data

http://itoworld.com

Page 52: Open Source, Open Data

Semantic Web

• Describe meaning, not markup

• Triples: subject, predicate, object

• Expression: RDF

Page 53: Open Source, Open Data

Linked Open Data

Page 54: Open Source, Open Data

Semantic web tools

• Triple stores• Sesame, BigData, Virtuoso...

• Libraries• RDFLib (Python), Redland RDf (librdf)...

Page 55: Open Source, Open Data

Freebase Acre

Page 56: Open Source, Open Data

Open source for open data

• Low barrier to entry

• Hooks in to Freebase data

• Share and clone apps

• Apps are BSD licensed

Page 57: Open Source, Open Data

FMDB

Page 58: Open Source, Open Data

Gendered names app

Page 59: Open Source, Open Data

Query editor

Page 60: Open Source, Open Data

Clone!

Page 61: Open Source, Open Data

http://freebase.com/developer

Page 62: Open Source, Open Data

Where next?

Page 63: Open Source, Open Data

Open Data: Issues

• License clarity

• Govt + Corporate acceptance

• Developer literacy

• What do we DO with it?

Page 64: Open Source, Open Data

What do we do with it?

Page 65: Open Source, Open Data

What do we do with it?

• 10 years ago we were asking the same questions of Open Source

• With Open Data, we are just starting to realise its potential

• Please join us!

Page 66: Open Source, Open Data

Keep in touch

• Email• [email protected]

• Freebase blog• http://blog.freebase.com/

• Twitter• @fbase