44
#FAOdata 07/06/201 2 data.fao.org - European Data Forum 2012 1 #FAOdata http://data.fao.org Why, What and How Karl Morteo , Project Manager, IT Division, UN-FAO [email protected]

EDF2012 Karl Morteo - data.fao.org

Embed Size (px)

Citation preview

Page 1: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 1

#FAOdata

http://data.fao.org Why, What and How

Karl Morteo , Project Manager, IT Division, UN-FAO [email protected]

Page 2: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 2

#FAOdata

Food and Agriculture Organization of the United Nations

For a world without hunger

Page 3: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 3

#FAOdata

Topics Animal Production & Health Economics & Policy Education & Extension Engineering, Technology &

Research Farming Practices & Systems Fisheries & Aquaculture Food Security Food safety & Human nutrition

Forestry Natural Resources &

Environment Plant Production &

Protection Rural & Social Development Climate, Energy and Tenure Land and Water Agriculture

Page 4: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 4

#FAOdata

Recommendations from Evaluations, Review and Consultations

n Unifying the fragmented and “linear” information storage and dissemination in the area of the technical systems

n Consolidating the fragmented and duplicated Information Systems and Information Management environments presently distributed across many organizational units

n A top priority for the Information Technology Division is Consistent, Integrated, Authoritative and Easy to Access Corporate Data Repository

The Need

Page 5: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 5

#FAOdata

Uniting Our Datahttp://data.fao.org is a one-stop shop that aggregates, integrates, and catalogues data from multiple sources within the Organization. These entries cover topics related to nutrition, food and agriculture and include data such as statistics, maps, pictures, documents and more. 

Page 6: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 6

#FAOdata

n Web Site: http://data.fao.org for data consumers, managers and information systems integrators

n Data Warehouse Repository (supporting statistics, maps and digital assets)n Tools for data and metadata management including support for data hotelsn A platform for integration and exchange n Integration of existing information systemsn Harmonisation and de-duplication of datan Widgets to embed services in websites, including charts, maps and contentn Advocacy and communicationn Project management and administration

Overview of Deliverables

Page 7: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 7

#FAOdata

Mantra

1. Uniting our data – data.fao.org tagline 2. Engage not just disseminate (harness the crowd)

#FAOdata3. Mobile First – J. Boye – Morgan Stanely4. Serve the data in the most convenient format

for all users - Linked and Open Data5. Eat your own dog food -

http://en.wikipedia.org/wiki/Eating_your_own_dog_food

Page 8: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 8

#FAOdata

The Website

Should be the home screen

Page 9: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 9

#FAOdata

1. Browse and share2. Find and use (e.g. journalists preparing reports)3. Query and analyze 4. Just give me the data

Website - Use cases

Page 10: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 10

#FAOdata

Site structure6 official languages Login Help

Full text search

Navigation by Topics, Concepts, Tags, Places, Catalogue

Structured search

Filter by object types

Page 11: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 11

#FAOdata

Statistics, maps, pictures, documents and more.

Page 12: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 12

#FAOdata

Search

Picture from 123RF

Page 13: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 13

#FAOdata

Navigate

Page 14: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 14

#FAOdata

Landing Page

Page 15: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 15

#FAOdata

Engage, not just disseminate

Data in the most convenient for you

RDF, JSON, XML, CSV, Excel, RoamBi, GeoTIFF, WMS

Page 16: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 16

#FAOdata

Engage, not just disseminate

Login and share with popular Social Networks: Google+ Facebook Twitter

Expertise at FAO: Curate Create Link, Tag, Share Comment

Page 17: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 17

#FAOdata

Website – Statistical Data

Page 18: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 18

#FAOdata

Mobile First - Responsive Design

Page 19: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 19

#FAOdata

Data Loading10 of 16 knowledge divisions

356,000,000 Statistical values734,000 Geo Layers1,500,000 Statistical Maps

435 Documents90 Pictures

40+ Terabytes (2 Stats, 8 Maps online)25 Information Systems

Page 20: EDF2012   Karl Morteo - data.fao.org

finding data will be this easy...

Release One

12 December 2012

For more information, visit us at data.fao.org

Picture from Goodfon

Page 21: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 21

#FAOdata

n Custodian only providing the receptacle, the data ownership, rights and responsibilities stay with the contributor

n Will not replace existing thematic ‘brands’n Open to all FAO contributorsn One stop shop single point of access and reference

for all data

Guiding Principles

21

Page 22: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 22

#FAOdata

Data Flow and Integration

Ingest

Publish

HarmoniseIntegrate

Enrich

DataSource

DataSource

DataSource

DataSource

Page 23: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 23

#FAOdata

data.fao.org - The Big Picture

Orchestration and IntegrationContentDocumentsPicturesVideoMultimediaPages

StatisticsStatistical Data Warehouse

Time SeriesIndicatorsObservations

MapsGeospatial RasterVectorPoint

CatalogueIdentityMetadataLinked Data...

InfrastructureLoggingCachingSecurityAudit...

WebsiteSpecialised

application(s)consume/provide

SearchFull textStructured

Services and Widgets

Page 24: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 24

#FAOdata

Every asset is registered in the Cataloguewith a Universally Unique Identifier (UUID).

UUID:11ff9540-be95-46c6-b2bc-c5874d21d4d1With a corresponding URI http://ref.data.fao.org/UUIDAssets can also be accessible and managed through an Application Programming Interfaces (API)

http://api.data.fao.org/resources/ images/countryprofiles/flag-arg

URI: http://uri.fao.org/11ff9540-be95-46c6-b2bc-c5874d21d4d1 Friendly-URL:

http://data.fao.org/resources/images/countryprofiles/flag-arg Resource relationships: triple = subject + predicate + object Resource ownership & authorizations: workspaces, public/protected/private, users,

roles Tags & Categories Search: structured, full-text, XQuery, SPARQL Localization

Catalogue, UUIDs and APIs

Page 25: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 25

#FAOdata

Structural , typically derived from physical data, organization and/or structure. Examples include the dimensions of a data cube, number of facts in a data set or image file format.

Reference, describes the data and is the information used to understand the data. Examples include title, author, subjects, keywords, publisher, method of collection and processing, data source and references.

Administrative, technical information regarding the data storage and rights management including: ownership, permissions, visibility, security, rights back up, audit, and preservation.

Social, describes how the community consume, use and categories the data. Examples include: number of citations, download counts, user rating, comments and tagging.

Types of Metadata

The cure for information overload is…more information!

Page 26: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 26

#FAOdata

Metadata Standards

Asset Standards / FormatsStatistics SDMX, DDIMaps OGC, ISO19115Documents Dublin Core, MODS, FRBRPictures IPTC, XMPTags …… more

Page 27: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 27

#FAOdata

UN-FAO corporate, multilingual thesaurus of topics related to agriculture, forestry, fisheries, environment and more. Available in more than 20 languages, with an average of 40.000 terms.Published as Linked Data and aligned with twelve

vocabularies. http://aims.fao.org/agrovoc/lod

AGROVOC

Information Backbone

Linking inside and outside

Exact/Broader/Narrower/Related

Page 28: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 28

#FAOdata

Experience, Guidance and Best Practices

Linked Data Model to expose social metadata

Alignment and LOD-ification Central Product Classification in RDF

integrated in the data catalogue of data.fao.org

LOD2: Publink Consultation Program

Page 29: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 29

#FAOdata

UN-FAO geopolitical ontology. Available in 6+1 languages. Includes FAO, UN and internationally recognized data sources. Records historical changes.

Published as Linked Data, RDF and OWL http://www.fao.org/countryprofiles/geoinfo/geopolitical/resource/

FAO Geopolitical Ontology

Page 30: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 30

#FAOdata

FAO Geopolitical OntologyNames (short, official) , Codes,

Area types, Groups membership

Supporting data: coordinates, Land borders Currency names and codes, Adjectives of nationality,

Basic statistical data: country area, land area, agricultural land area, population and GDP

Page 31: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 31

#FAOdata

APIs, Widgets and Embedded Content

http://review-data.fao.org/developers

What’s a widget?Program that performs simple function and can

be embedded into a webpage (e.g. weather report or stock quote)

Page 32: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 32

#FAOdata

Widgets – Country Profiles

CountryProfiles = FAO site + Widgets

Page 33: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 33

#FAOdata

Country Information

Country Profiles www.fao.org

Country Briefs www.fao.org

FAO Representations websites (Sub-regional Office for the Pacific Islands)

Country Office Information Network coin.fao.org

International Plant Protection Convention Country Profiles www.ippc.int

Names of Countries http://termportal.fao.org/faonocs/main/start.do

Page 34: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 34

#FAOdata

Web Application Example - Key Water Indicator Portal

http://www.unwater.org/statistics_KWIP.html

Page 35: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 35

#FAOdata

Information at 3 inputs levels (high, intermediate, low) on: agro-ecological zones agro-climatically attainable yields yield constraints crop calendars agro-ecological suitability and productivity assessment potential production estimates actual yield and production yield and production gaps

Global Agro-Ecological Zones

Page 36: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 36

#FAOdata

THEMES 5 thematic areas (> 300,000 global datasets at mainly 5 arc-minutes, also core

layers at 30 arc-seconds) CROPS

11 crop groups, 49 crops, 92 crop types and 280 Crop/LUTs downscaling of 23 crops/commodities are available for 23 crop/commodities yield and production gap analysis for 17 crops/commodities

WATER SUPPLY 5 water supply types (rain-fed, irrigation, gravity, sprinkler, drip)

INPUT LEVELS 4 Input levels (High, Intermediate, Low, Mixed)

TIME PERIOD Historical 1961-2000, 30 year average (1961-1990) and Future, 2020s, 2050s, 2080s)

Global Agro-Ecological Zones

Page 37: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 37

#FAOdata

http://gaez.fao.org

Page 38: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 38

#FAOdata

data.fao.org - The Technology

Orchestration and IntegrationContentDocumentsPicturesVideoMultimediaPages

StatisticsStatistical Data Warehouse

Time SeriesIndicatorsObservations

MapsGeospatial RasterVectorPoint

CatalogueIdentityMetadataLinked Data...

InfrastructureLoggingCachingSecurityAudit...

WebsiteSpecialised

application(s)consume/provide

SearchFull textStructured

Services and Widgets

Page 39: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 39

#FAOdata

Resource Catalog

KISAF: Linked Open Data – 24 November, 2011 39

Resource Catalog

Triple Store(Ontotext) XML-DBFull-text

Index (SOLR)RDBMS

(Postgres)

SPARQL XQueryFull-textquerySQL/JPA-QL

Business Layer (JEE)

Service Layer

SOAP Web Service interface RESTful interface

Enterprise Service Bus (ESB)

Other data.fao.org Subsystems External Applications

Page 40: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 40

#FAOdata

Opscode –Chef, Red Hat- RHQ, Nagios- Enterprises Nagios, Jenkins-CI – Opensource – Jenkins, Apache Maven, Apache – SVN, Atlassian – JIRA, Apache – Jmeter, Sourceforge Opensource – Junit, Oracle- Java, Liferay - Liferay Portal, PostgreSQL – PostgreSQL PostGIS, Pgbouncer – pgbouncer, Pentaho - Pentaho BI, Analytical Labs – Saiku, Talend - Talend Open Studio, RedHat/Jboss - Application Server, Vmware - Server Virtualization,

Exceliance – HAProxy, RedHat- Enterprise Linux Server, GeoNetwork OpenSource – GeoNetwork, OpenGeo, GeoSolutions, Fedora Commons, Refractions Research – GeoServer, Ontotext - OWLIM, RedHat/Jboss - JBoss AS / JEE, Alfresco - Activiti BPM Platform, jasig - CAS Client, Batchwork Software - Doc2Doc, Google Code Opensource - ZXing ("Zebra Crossing"), Highsoft Solutions AS – highcharts, Twitter – Bootstrap, Liferay - Alloy-UI, JQuery - JQuery-ui, Geert de Deckere - Graph Up and the there are all the SaaS products ….

Technologies – Just 1 or 2 ;-)

Page 41: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 41

#FAOdata

Some Big Bits

8 Clustered and Load Balanced

Clustered and Load Balanced

~2TB

ESB and BPM

Page 42: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 42

#FAOdata

Fine-Grained monitoring and alert – RHQ + Nagios Backup: VM images and DB redundancy 14* Physical Servers, 40+ Production VMs 100 TB SAN, 24 port 1 Gigabit switchDevOps: Developers and SysAdmins working together as one to deliver agile, dependable and responsive solutions:

Changing the way we do IT

Infrastructure

SAN 100 TB

Page 43: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 43

#FAOdata

Build Lifecycle – Continuous Integration

Page 44: EDF2012   Karl Morteo - data.fao.org

#FAOdata

07/06/2012 data.fao.org - European Data Forum 2012 44

#FAOdata

Thank you for listening …

Questions?

Thank you & Questions