Upload
imarine283644
View
50
Download
0
Embed Size (px)
DESCRIPTION
Presentation Pasquale Pagano, CNR-ISTI, iMarine Technical Director on the services provided through the iMarine data e-infrastructure
Citation preview
How iMarine fulfils data needs in support of the Ecosystem Approach
Introduction
Pasquale Pagano (CNR) iMarine Technical Director pasquale.pagano@is<.cnr.it
From collabora<on to market and future growth 29th September 2014
Brussels, Belgium
iMarine
iMarine exploits a Hybrid Data Infrastructure by • combining over 500 soLware components • providing access to more than 25k datasets • serving more than 1000 jobs a day
iMarine capaci<es are offered as services
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Data Compu<ng Applica<ons
European Commission premises, DG CONNECT, 29th September 2014, Brussels
iMarine Capaci<es
Data: Storage as Service
European Commission premises, DG CONNECT, 29th September 2014, Brussels
to host and maintain data
Database High-‐availability
Standard Ready-‐to-‐use
Cloud Storage Scalable Reliable Secure
Geographical DB Scalable
OGC Standard Privacy and A<ribu=on
Data: Applica<ons as a Service
European Commission premises, DG CONNECT, 29th September 2014, Brussels
to curate and manage data
Metadata Genera@on Geospa=al Data Biodiversity Data Sta=s=cal Data
Harmoniza@on Disambiguate
Validate Integrate and Consistency Check
Data Exchange OGC protocols DarwinCore
SDMX
iMarine
OBIS WoRMS
WoRDS
GBIF
CoL
ITIS
IRMNG NCBI
MyOcean
WOA
EuroStat
Data.FAO
…
Data
European Commission premises, DG CONNECT, 29th September 2014, Brussels
iMarine Registries
Valida<on
Enriching
Processing
Sharing
Data
Ontologies and Data
Warehouses
Biological and
Ecological Data
GeoSpa<al Data
Sta<s<cal Data
Documents
European Commission premises, DG CONNECT, 29th September 2014, Brussels
DarwinCore / ISO19139 >35 M Observa<ons (OBIS) ≈ 120 K Observed Species (OBIS) ≈ 500 K Taxa (WoRMS) >600 K Scien<fic Names (ITIS) >12 K Species Maps (AquaMaps) ≈ 600 Species Extent (FAO) … FishBase, SeaLifeBase … CoL, GBIF
SDMX * Ø FAO CodeLists Ø IRD CodeLists Ø FAO datasets Ø Eurostat Ø …
ISO19139 (OGC W*S) Ø 10 years Chemical and Physical variables in 2D space
Ø Ice concentra<on and velocity, Chlorophyll, Oxygen, Nitrate, Phosphate, Phytoplankton as carbon, Salinity, Temperature, …
Ø On-‐demand Chemical and Physical variables in 3D space Ø Apparent Oxygen U<liza<on, Dissolved Oxygen, Salinity, Temperature, …
> 350
varia
bles
OAI-‐PMH, OpenSearch Ø FAO Facksheets Ø Aqua<c Commons Ø Bioline Interna<onal Ø Biodiversity Heritage Ø OceanDocs Ø Nature, PenSoL
Journals Ø …
RDF, OWL Ø FAO FLOD Ø Marine Top Level Ontology Ø IRD Ecoscope Ø FactForge, Yago2 Ø …
Capaci<es: Compu<ng as Service
European Commission premises, DG CONNECT, 29th September 2014, Brussels
to process and extract knowledge
Scalable Easy to Manage Across Boundaries
Tailored
Elas@c Assignment of Compu=ng Assignment of Processors
Virtual Research Environment
Rich and Heterogeneous High Throughput Map-‐Reduce Parallel R
Capaci<es: Compu<ng as Service
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Management and interpreta<on of biological and ecological data in the environment
Complete full life-‐cycle data framework, from observa<onal data to aggregated data repositories enriched with valida<on and analy<cal tools
Storage and interpreta<on of geospa<al explicit informa<on, including WPS processing
Flexible sharing, storage, repor<ng, search and retrieval, aggrega<on and projec<on facili<es
Applica<ons as a Service
European Commission premises, DG CONNECT, 29th September 2014, Brussels
A BUNDLE is a set of
services and technologies grouped
according to a family of related tasks for
achieving a common objec<ve
Occurrence and Taxonomic Data Discovery Occurrence Data Processing Species Distribu<on Modeling Species Distribu<on Maps Discovery Taxonomic Data Comparison Taxonomic Data Matching
Code List Discovery Code List Management Sta<s<cal Engine Tabular Data Discovery Tabular Data Enrichment Tabular Data Management Tabular Data Processing
Geospa<al Data Discovery Geospa<al Data Processing
Enhanced Documents Management Fact-‐sheets Management Informa<on Object Discovery Messaging Shared Workspace Social Networking Facili<es
Applica<ons as a Service
European Commission premises, DG CONNECT, 29th September 2014, Brussels
A BUNDLE is a set of
services and technologies grouped
according to a family of related tasks for
achieving a common objec<ve
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Presence Points
(FishBase +
Obis)
Density Based Clustering DBSCAN
(with outliers)
Other methods are also available …
K-‐Means
X-‐Means
Features Clustering with StatsCube
Data Analysis with StatsCube Import
CodeLists
Validate Datasets
Analyse And
Project
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Ecological Modeling with BiolCube
European Commission premises, DG CONNECT, 29th September 2014, Brussels
VS
FAO Eleutheronema tetradactylum
AquaMaps Eleutheronema tetradactylum
Maps Comparison with GeosCube
MEAN=0.81 VARIANCE=0.02 NUMBER_OF_ERRORS=6691 NUMBER_OF_COMPARISONS=259200 ACCURACY=97.42 MAXIMUM_ERROR=1.0 MAXIMUM_ERROR_POINT=3005:363:1 COHENS_KAPPA=0.218 COHENS_KAPPA_CLASSIFICATION_LANDIS_KOCH=Fair COHENS_KAPPA_CLASSIFICATION_FLEISS=Marginal TREND=EXPANSION RESOLUTION=0.5
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Virtual Research Environment
European Commission premises, DG CONNECT, 29th September 2014, Brussels
to share and collaborate
Share Database Tables
Workflow Files
Communicate Post
Favourite Connec=on
Organize Dynamic VRE Crea=on
Secure Policy Control
iMarine Technology
• iMarine is powered by gCube
European Commission premises, DG CONNECT, 29th September 2014, Brussels
hnps://www.openhub.net/p/gCube
iMarine e-‐infrastructure
iMarine is exploi<ng D4Science.org
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Geographically Distributed Compu<ng
Infrastructure
Across administra<ve boundaries
Across private and commercial
providers
Service Alloca<ons, Deployment,
Monitoring, and Opera<on
Uniform resource and data access
Opera<on Built on SLAs
Support monitoring, audi<ng, repor<ng, and no<fica<on
Trust Privacy, governance, and anribu<on
Security, trusted network
Mul<-‐tenant Delivery Model
Infrastructure as a Service
• Dynamic deployment • Hos<ng • Resource Lifecycle • Monitoring • Accoun<ng • Security
SoLware as a Service
• BiolCube • ConnectCube • GeosCube • StatsCube
Plaqorm as a Service
• FeatherWeightStack • SmartGears • Applica<onSupportLayer • SOA3
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Landscape
D4Science e-‐Infrastructure
gCube Framework
gCube Apps
Discussion
www.i-‐marine.eu
i-‐marine.d4science.org
European Commission premises, DG CONNECT, 29th September 2014, Brussels
Thank You
Google Analy<cs iMarine portal
European Commission premises, DG CONNECT, 29th September 2014, Brussels