Upload
european-data-forum
View
824
Download
1
Embed Size (px)
Citation preview
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 1
#FAOdata
http://data.fao.org Why, What and How
Karl Morteo , Project Manager, IT Division, UN-FAO [email protected]
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 2
#FAOdata
Food and Agriculture Organization of the United Nations
For a world without hunger
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 3
#FAOdata
Topics Animal Production & Health Economics & Policy Education & Extension Engineering, Technology &
Research Farming Practices & Systems Fisheries & Aquaculture Food Security Food safety & Human nutrition
Forestry Natural Resources &
Environment Plant Production &
Protection Rural & Social Development Climate, Energy and Tenure Land and Water Agriculture
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 4
#FAOdata
Recommendations from Evaluations, Review and Consultations
n Unifying the fragmented and “linear” information storage and dissemination in the area of the technical systems
n Consolidating the fragmented and duplicated Information Systems and Information Management environments presently distributed across many organizational units
n A top priority for the Information Technology Division is Consistent, Integrated, Authoritative and Easy to Access Corporate Data Repository
The Need
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 5
#FAOdata
Uniting Our Datahttp://data.fao.org is a one-stop shop that aggregates, integrates, and catalogues data from multiple sources within the Organization. These entries cover topics related to nutrition, food and agriculture and include data such as statistics, maps, pictures, documents and more.
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 6
#FAOdata
n Web Site: http://data.fao.org for data consumers, managers and information systems integrators
n Data Warehouse Repository (supporting statistics, maps and digital assets)n Tools for data and metadata management including support for data hotelsn A platform for integration and exchange n Integration of existing information systemsn Harmonisation and de-duplication of datan Widgets to embed services in websites, including charts, maps and contentn Advocacy and communicationn Project management and administration
Overview of Deliverables
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 7
#FAOdata
Mantra
1. Uniting our data – data.fao.org tagline 2. Engage not just disseminate (harness the crowd)
#FAOdata3. Mobile First – J. Boye – Morgan Stanely4. Serve the data in the most convenient format
for all users - Linked and Open Data5. Eat your own dog food -
http://en.wikipedia.org/wiki/Eating_your_own_dog_food
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 8
#FAOdata
The Website
Should be the home screen
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 9
#FAOdata
1. Browse and share2. Find and use (e.g. journalists preparing reports)3. Query and analyze 4. Just give me the data
Website - Use cases
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 10
#FAOdata
Site structure6 official languages Login Help
Full text search
Navigation by Topics, Concepts, Tags, Places, Catalogue
Structured search
Filter by object types
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 11
#FAOdata
Statistics, maps, pictures, documents and more.
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 12
#FAOdata
Search
Picture from 123RF
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 13
#FAOdata
Navigate
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 14
#FAOdata
Landing Page
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 15
#FAOdata
Engage, not just disseminate
Data in the most convenient for you
RDF, JSON, XML, CSV, Excel, RoamBi, GeoTIFF, WMS
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 16
#FAOdata
Engage, not just disseminate
Login and share with popular Social Networks: Google+ Facebook Twitter
Expertise at FAO: Curate Create Link, Tag, Share Comment
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 17
#FAOdata
Website – Statistical Data
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 18
#FAOdata
Mobile First - Responsive Design
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 19
#FAOdata
Data Loading10 of 16 knowledge divisions
356,000,000 Statistical values734,000 Geo Layers1,500,000 Statistical Maps
435 Documents90 Pictures
40+ Terabytes (2 Stats, 8 Maps online)25 Information Systems
finding data will be this easy...
Release One
12 December 2012
For more information, visit us at data.fao.org
Picture from Goodfon
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 21
#FAOdata
n Custodian only providing the receptacle, the data ownership, rights and responsibilities stay with the contributor
n Will not replace existing thematic ‘brands’n Open to all FAO contributorsn One stop shop single point of access and reference
for all data
Guiding Principles
21
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 22
#FAOdata
Data Flow and Integration
Ingest
Publish
HarmoniseIntegrate
Enrich
DataSource
DataSource
DataSource
DataSource
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 23
#FAOdata
data.fao.org - The Big Picture
Orchestration and IntegrationContentDocumentsPicturesVideoMultimediaPages
StatisticsStatistical Data Warehouse
Time SeriesIndicatorsObservations
MapsGeospatial RasterVectorPoint
CatalogueIdentityMetadataLinked Data...
InfrastructureLoggingCachingSecurityAudit...
WebsiteSpecialised
application(s)consume/provide
SearchFull textStructured
Services and Widgets
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 24
#FAOdata
Every asset is registered in the Cataloguewith a Universally Unique Identifier (UUID).
UUID:11ff9540-be95-46c6-b2bc-c5874d21d4d1With a corresponding URI http://ref.data.fao.org/UUIDAssets can also be accessible and managed through an Application Programming Interfaces (API)
http://api.data.fao.org/resources/ images/countryprofiles/flag-arg
URI: http://uri.fao.org/11ff9540-be95-46c6-b2bc-c5874d21d4d1 Friendly-URL:
http://data.fao.org/resources/images/countryprofiles/flag-arg Resource relationships: triple = subject + predicate + object Resource ownership & authorizations: workspaces, public/protected/private, users,
roles Tags & Categories Search: structured, full-text, XQuery, SPARQL Localization
Catalogue, UUIDs and APIs
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 25
#FAOdata
Structural , typically derived from physical data, organization and/or structure. Examples include the dimensions of a data cube, number of facts in a data set or image file format.
Reference, describes the data and is the information used to understand the data. Examples include title, author, subjects, keywords, publisher, method of collection and processing, data source and references.
Administrative, technical information regarding the data storage and rights management including: ownership, permissions, visibility, security, rights back up, audit, and preservation.
Social, describes how the community consume, use and categories the data. Examples include: number of citations, download counts, user rating, comments and tagging.
Types of Metadata
The cure for information overload is…more information!
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 26
#FAOdata
Metadata Standards
Asset Standards / FormatsStatistics SDMX, DDIMaps OGC, ISO19115Documents Dublin Core, MODS, FRBRPictures IPTC, XMPTags …… more
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 27
#FAOdata
UN-FAO corporate, multilingual thesaurus of topics related to agriculture, forestry, fisheries, environment and more. Available in more than 20 languages, with an average of 40.000 terms.Published as Linked Data and aligned with twelve
vocabularies. http://aims.fao.org/agrovoc/lod
AGROVOC
Information Backbone
Linking inside and outside
Exact/Broader/Narrower/Related
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 28
#FAOdata
Experience, Guidance and Best Practices
Linked Data Model to expose social metadata
Alignment and LOD-ification Central Product Classification in RDF
integrated in the data catalogue of data.fao.org
LOD2: Publink Consultation Program
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 29
#FAOdata
UN-FAO geopolitical ontology. Available in 6+1 languages. Includes FAO, UN and internationally recognized data sources. Records historical changes.
Published as Linked Data, RDF and OWL http://www.fao.org/countryprofiles/geoinfo/geopolitical/resource/
FAO Geopolitical Ontology
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 30
#FAOdata
FAO Geopolitical OntologyNames (short, official) , Codes,
Area types, Groups membership
Supporting data: coordinates, Land borders Currency names and codes, Adjectives of nationality,
Basic statistical data: country area, land area, agricultural land area, population and GDP
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 31
#FAOdata
APIs, Widgets and Embedded Content
http://review-data.fao.org/developers
What’s a widget?Program that performs simple function and can
be embedded into a webpage (e.g. weather report or stock quote)
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 32
#FAOdata
Widgets – Country Profiles
CountryProfiles = FAO site + Widgets
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 33
#FAOdata
Country Information
Country Profiles www.fao.org
Country Briefs www.fao.org
FAO Representations websites (Sub-regional Office for the Pacific Islands)
Country Office Information Network coin.fao.org
International Plant Protection Convention Country Profiles www.ippc.int
Names of Countries http://termportal.fao.org/faonocs/main/start.do
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 34
#FAOdata
Web Application Example - Key Water Indicator Portal
http://www.unwater.org/statistics_KWIP.html
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 35
#FAOdata
Information at 3 inputs levels (high, intermediate, low) on: agro-ecological zones agro-climatically attainable yields yield constraints crop calendars agro-ecological suitability and productivity assessment potential production estimates actual yield and production yield and production gaps
Global Agro-Ecological Zones
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 36
#FAOdata
THEMES 5 thematic areas (> 300,000 global datasets at mainly 5 arc-minutes, also core
layers at 30 arc-seconds) CROPS
11 crop groups, 49 crops, 92 crop types and 280 Crop/LUTs downscaling of 23 crops/commodities are available for 23 crop/commodities yield and production gap analysis for 17 crops/commodities
WATER SUPPLY 5 water supply types (rain-fed, irrigation, gravity, sprinkler, drip)
INPUT LEVELS 4 Input levels (High, Intermediate, Low, Mixed)
TIME PERIOD Historical 1961-2000, 30 year average (1961-1990) and Future, 2020s, 2050s, 2080s)
Global Agro-Ecological Zones
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 37
#FAOdata
http://gaez.fao.org
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 38
#FAOdata
data.fao.org - The Technology
Orchestration and IntegrationContentDocumentsPicturesVideoMultimediaPages
StatisticsStatistical Data Warehouse
Time SeriesIndicatorsObservations
MapsGeospatial RasterVectorPoint
CatalogueIdentityMetadataLinked Data...
InfrastructureLoggingCachingSecurityAudit...
WebsiteSpecialised
application(s)consume/provide
SearchFull textStructured
Services and Widgets
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 39
#FAOdata
Resource Catalog
KISAF: Linked Open Data – 24 November, 2011 39
Resource Catalog
Triple Store(Ontotext) XML-DBFull-text
Index (SOLR)RDBMS
(Postgres)
SPARQL XQueryFull-textquerySQL/JPA-QL
Business Layer (JEE)
Service Layer
SOAP Web Service interface RESTful interface
Enterprise Service Bus (ESB)
Other data.fao.org Subsystems External Applications
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 40
#FAOdata
Opscode –Chef, Red Hat- RHQ, Nagios- Enterprises Nagios, Jenkins-CI – Opensource – Jenkins, Apache Maven, Apache – SVN, Atlassian – JIRA, Apache – Jmeter, Sourceforge Opensource – Junit, Oracle- Java, Liferay - Liferay Portal, PostgreSQL – PostgreSQL PostGIS, Pgbouncer – pgbouncer, Pentaho - Pentaho BI, Analytical Labs – Saiku, Talend - Talend Open Studio, RedHat/Jboss - Application Server, Vmware - Server Virtualization,
Exceliance – HAProxy, RedHat- Enterprise Linux Server, GeoNetwork OpenSource – GeoNetwork, OpenGeo, GeoSolutions, Fedora Commons, Refractions Research – GeoServer, Ontotext - OWLIM, RedHat/Jboss - JBoss AS / JEE, Alfresco - Activiti BPM Platform, jasig - CAS Client, Batchwork Software - Doc2Doc, Google Code Opensource - ZXing ("Zebra Crossing"), Highsoft Solutions AS – highcharts, Twitter – Bootstrap, Liferay - Alloy-UI, JQuery - JQuery-ui, Geert de Deckere - Graph Up and the there are all the SaaS products ….
Technologies – Just 1 or 2 ;-)
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 41
#FAOdata
Some Big Bits
8 Clustered and Load Balanced
Clustered and Load Balanced
~2TB
ESB and BPM
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 42
#FAOdata
Fine-Grained monitoring and alert – RHQ + Nagios Backup: VM images and DB redundancy 14* Physical Servers, 40+ Production VMs 100 TB SAN, 24 port 1 Gigabit switchDevOps: Developers and SysAdmins working together as one to deliver agile, dependable and responsive solutions:
Changing the way we do IT
Infrastructure
SAN 100 TB
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 43
#FAOdata
Build Lifecycle – Continuous Integration
#FAOdata
07/06/2012 data.fao.org - European Data Forum 2012 44
#FAOdata
Thank you for listening …
Questions?
Thank you & Questions