33
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n. 306819 Research Communities, Data Repositories and Science Gateways EUMEDCONNECT3 Meeting, Amman – 16 June 2013 Federico Ruggieri, INFN/GARR ([email protected])

Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing

Research Infrastructures – Proposal n. 306819

Research Communities, Data Repositories and Science Gateways

EUMEDCONNECT3 Meeting, Amman – 16 June 2013

Federico Ruggieri, INFN/GARR ([email protected])

Page 2: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Outline

The CHAIN-REDS project Research Communities The CHAIN-REDS Knowledge Base The Science Gateways Summary and conclusions

Page 3: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Regional Grid infrastructures

CNGrid

NKN & Garuda

EUAsiaGrid

SAGrid & SANREN

GISELA

3

Page 4: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Status

Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing

Research Infrastructures – Support Action Grant Agreement n. 306819 Total Costs of € 2.3 M Max. EC contribution: € 1.52 M Start date: 1 December 2012 Duration: 30 Months

4

Page 5: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Partners and roles

5

INFN (IT) – Coordinator CIEMAT (ES) – WP4 Leader GRNET (GR) – WP3 Leader CESNET (CZ) – WP5 Leader UBUNTUNET (MW) – Africa CLARA (UR) – Latin America IHEP (CN) – China ASREN (DE) – Arab States SIGMA-ORIONIS (FR) – WP2 Leader C-DAC (IN) – India (Amendment)

Page 6: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Action lines (1/2)

Distributed Computing Infrastructure DCI

• Provide ongoing support of the DCI road-map for intercontinental DCI collaboration, specified within the CHAIN project

Regional Operation Centres ROC

• Support stability of existing and emerging Regional Operation Centres. Cooperate with other projects & initiatives (e.g. AfricaConnect, TEIN3) to support the development of eInfrastructures and key VRCs in Africa, Asia, Latin America and the Middle-east

Clouds for Research and Education Cloud

• Support for coordination of Cloud developments for Research & Education with other regions (e.g. China, India, Latin America)

Page 7: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Action lines (2/2)

Infrastructures and Repositories Data

• Extend the CHAIN Knowledge Base with information on Data Infrastructures: collecting issues, best practices and identifying data repositories of direct interest for VRCs Support the study of data infrastructures for a target subset of VRCs (e.g. Agricolture, Climate Change, Health, Biomedicine, etc.)

Science Gateways SG

• Promote the usage of Science Gateways as a means for attracting new communities and promote the use of eInfrastructures for every researcher

Federations of Identity Providers IDF

• Foster the creation of Identity Federations in cooperation with Certification Authorities; promote and coordinate their usage. Support integration of different AA approaches

Page 8: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Support Regional Operation Centres http://roc.africa-grid.org

8

Page 9: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

VRCs addressed by the first CHAIN project

Several Virtual Research Communities in different domains have been contacted: Agriculture, Biomedicine, Climate Change, Digital Cultural Heritage, Health, High Energy Physics, Life Sciences, Seismology, Weather Forecast

MoUs signed with 6 VRCs Support for new VRCs: jModelTest, Seismology,

Climate Change, Agriculture Many applications demonstrated in the Worldwide

Interoperability Test with the Science Gateway

Page 10: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

New VRCs addressed

Agriculture (agINFRA project)

Open Geospatial Consortium (EarthServer)

EUDAT

Page 11: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

The CHAIN Knowledge Base (www.chain-project.eu/knowledge-base)

RREN(s) NREN NGI CA(s) Ident. Fed(s) ROC(s) Grid site(s) Application(s)

Largest e-Infrastructure related knowledge base. Information both from the survey and other sources for more than half of the countries of the world

Page 12: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

CHAIN-REDS program for Data Infrastructures

Identify standards to easily gather and access both Open Access Document Repositories (OADRs) and Data Repositories (DRs)

Build a demonstrator to easy visualise and access OADRs and DRs (both geo-views and tab-views)

Correlate OADRs and DRs to create linked data and discover new knowledge through semantic enrichment of metadata

Promote Data Infrastructure standards and identify new OADRs and DRs from regions addressed by the project (Africa, Middle-East and Gulf Region, Latin America, China, India, Far-East Asia)

Populate the demonstrator with these new repositories, add them to the semantic enrichment tool, and set-up at least two use-cases from different domains

Page 13: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Open Access Document Repositories (OADRs)

• ∼2,500 repos • >33 M docs

Page 14: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Open Access Document Repositories (OADRs)

14

Page 15: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Data Repositories (DRs)

• >500 repos • Lots of data !

Page 16: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Data Repositories (DRs)

Page 17: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

17

Extending the CHAIN-REDS KB with KLIOS

Page 18: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Linked data semantic search

Semantic enrichment Metadata harvesting

Multi-layered architecture

Page 19: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

OA

DR

s

Dat

a R

epos

. OAI-PMH OAI-PMH

Harvester (running on grid/cloud)

Linked-data search engine

Semantic-web enrichment

End-points

Harvester (running on grid/cloud)

Multi-layered architecture

Page 20: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

20

Linked data semantic search (www.chain-project.eu/linked-data)

Page 21: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

....... Science

Gatew

ay

App. 1 App. 2 App. N

Embedded Applications Administrator Power User Basic User

Users from different

organisations having different

roles and privileges

Promote the Science Gateway model

Standard-based middleware-independent

Grid Engine

21

Europe, Africa, Asia Pacific, Latin America

Brasil China

India

Page 22: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

The agINFRA Science Gateway

22

http://aginfra-sg.ct.infn.it/

Page 23: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

WebGIS based ISIS

23

Page 24: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

DECIDE Science Gateway (www.eu-decide.eu)

Page 25: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Africa Grid Science Gateway (http://sgw.africa-grid.org/

Page 26: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Science Gateways and clouds – The MyCloud service

26

Powered by:

Page 27: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

27

The CHAIN-REDS Cloud Interoperability Demo

VM Market Place MyCloud

Basic VM1

Basic VM2

VRC1 VM1 VRC1

VM2

VRC2 VM1 VRC2

VM2

Node of cloud m/w 1 on site X

Node of cloud m/w 2 on site Y

Node of cloud m/w 3 on site Z

Page 28: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Standards to be used

28

OCCI (http://occi-wg.org): For remote management of cloud computing infrastructure

CDMI (http://www.snia.org/cdmi): For cloud data management

Page 29: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Clouds and Grids

Grid has been conceived as a resource sharing infrastructure Grids provide High Throuput Computing that fits many

applications Clouds have a clear business model that allowed companies to

provide services on the market Virtualisation and elastic computing are needed by several

scientific domains (e.g. interactive and web based applications) Clouds and Grids can cohexist in the Research & Education

domain provided that the strenghts of both can be merged

29

Page 30: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

R&E Clouds

Cloud infrastructure for R&E should be based on Open SW & Standards

Financial: Public institutions are frequently receiving projects’ driven funding.

Difficult to fund long term contracts with Cloud providers

Technical: Resource sharing is an issue if institutions get services from

different providers. Building securely across several administrative domains is difficult: Federation of Clouds is still not a reality

Long term preservation of data has still many issues to be addressed (e.g. what happens to the data after the project end ?)

Requirements of scientists evolve and new technical challenges appear that will push for innovation

30

Page 31: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

GARR-X & GARR-X Progress

Next Generation Network GARR-X Access to HTC, HPC, Grid &

Cloud Computing, Storage

GARR-X Progress for the South of Italy Grant of 46.5 M€ recently approved

by the Ministry of Education and Research

10-100 Gbps fiber connectivity

New cloud and storage infrastructure for R&E: 6.000 Cores, 6 PB storage

31

Page 32: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Summary and conclusions The first usage of SG in several projects is very encouraging Data Infrastructures are becoming an essential component of e-

Infrastructures Next years’ biggest challenge will be to uniquely correlate scientific

papers with data used to write them with applications used to analyse them so to be able to go across the knowledge path both ways

Semantic-web and linked-data technologies can play a major role in this context and CHAIN-REDS aims to promote these standards in the targeted regions

OADRs’ and DRs’ managers/owners in are welcome to contact me to share their data within the CHAIN Knowledge Base (both in Africa and Latin America this is already happening)

CHAIN-REDS is also looking forward to receiving feedbacks from all interested organisations on the Knowledge Base and the semantic search service

Page 33: Research Communities, Data Repositories and Science Gateways · Document Repositories (OADRs) and Data Repositories (DRs) Build a demonstrator to easy visualise and access OADRs and

Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing

Research Infrastructures – Proposal n. 306819

Questions ?