37
Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre [email protected] Twitter: @sjDCC Webinar for the moderated course on the Horizon 2020 Open Research Data pilot, 10 th June 2016 https://www.fosteropenscience.eu/content/horizon-2020-open-research-data-pilot-0

Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre [email protected] Twitter: @sjDCC Webinar for the moderated

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Horizon 2020 Open Research Data PilotSarah Jones

Digital Curation [email protected]

Twitter: @sjDCC

Webinar for the moderated course on the Horizon 2020 Open Research Data pilot, 10th June 2016

https://www.fosteropenscience.eu/content/horizon-2020-open-research-data-pilot-0

Page 2: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

BACKGROUND

What are the drivers for open data?

Image CC-BY-SA-ND by Chris Smart www.flickr.com/photos/sigma/5297545749

Page 4: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Slide from Daniel Spichtinger and Celina Ramjoue

Page 5: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Development of EC Open Access policy

Pilot in FP7

Provision of support

OA mandate in H2020

Trialled in 7 areas

Expected to:

• Deposit articles into a repository

• Attempt to make these OA within 6 or 12 months

• Choice between green and gold routes

OA fees are eligible for reimbursement

Pilot supported and monitored through OpenAIRE

Each beneficiary must:

• Deposit machine-readable electronic copy in repository by the date of publication

• Ensure OA via green/gold routes within 6 or 12 month embargo

• Ensure bibliographic metadata is OA

• Aim to deposit research data

Page 6: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

H2020 Open Research Data pilot

Guidelines on Data Management in Horizon 2020http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf

Pilot in H2020

Provision of support ?

Take part and feed back to help shape future policy!

Page 7: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

REQUIREMENTS OF THE ORD PILOT

What is expected of participants?

Image CC-BY-NC-SA by Ralf Appelt www.flickr.com/photos/adesigna/4090782772

Page 8: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Open Research Data (ORD) Pilot

Pilot focuses on research data specifically

‘Research data’ refers to information, in particular facts or numbers, collected to be examined and considered as a basis for reasoning, discussion or calculation.

In a research context, examples of data include statistics, results of experiments, measurements, observations resulting from fieldwork, survey results, interview recordings and images. The focus is on research data that is available in digital form.

Guidelines on Open Access to Scientific Publications and Research Data in Horizon 2020 v.2.1, 15 February 2016, p3

Page 9: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

H2020 areas participating in the pilot (2016-17)

• Future and Emerging Technologies

• Research infrastructures

• Leadership in enabling and industrial technologies – Information and Communication Technologies

• Nanotechnologies, Advanced Materials, Advanced Manufacturing and Processing, and Biotechnology:  ‘nanosafety’ and ‘modelling’ topics

• Societal Challenge: Food security, sustainable agriculture and forestry, marine and maritime and inland water research and the bioeconomy - selected topics as specified in the work programme

• Societal Challenge: 'Climate Action, Environment, Resource Efficiency and Raw materials' – except raw materials

• Societal Challenge: 'Europe in a changing world – inclusive, innovative and reflective Societies'

• Science with and for Society

• Cross-cutting activities - focus areas – part Smart and Sustainable Cities

Projects in other areas can participate on a voluntary basis

Page 10: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

The scope of participation is growing...

• In 2014-15 work programme, 7 areas participated in the pilot.

• In the 2016 work programme, new topics joined in 3 areas (research infrastructures, nanotechnologies and food security)

• All calls covered by the 2017 work programme will be part of the pilot. Effectively it’s moving from a pilot to a mandate.

Page 11: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

• If results are expected to be commercially or industrially exploited

• If participation is incompatible with the need for confidentiality in connection with security issues

• Incompatible with existing rules on the protection of personal data

• Would jeopardise the achievement of the main aim of the action

• If the project will not generate / collect any research data

• If there are other legitimate reasons to not take part in the Pilot

Projects can opt out at any stage

Can opt out totally or partially (i.e. for some data only)

Should describe issues in the project DMP

Exemptions – reasons for opting out

Page 12: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Approach:

as open as possible, as closed as necessary

Image: ‘Balancing rocks’ by Viewminder CC-BY-SA-ND www.flickr.com/photos/light_seeker/7780857224

Page 13: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

• The data, including associated metadata, needed to validate the results presented in scientific publications

• Other curated and/or raw data, including associated metadata, as specified in the data management plan

Doesn’t apply to all data (researchers to define as appropriate)

Don’t have to share data if inappropriate – exemptions apply

Which data does the ORD pilot apply to?

Page 14: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Beneficiaries participating in the ORD pilot will:

• Deposit data in a research data repository

• Take measures to enable third parties to access, mine, exploit, reproduce and disseminate (free of charge for any user) this research data

• Provide information via the chosen repository about tools and instruments necessary for validating the results (where possible, provide the tools and instruments themselves)

Key requirements of the ORD pilot

Page 15: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Data Management Plans

Projects participating in the pilot will be required to develop a Data Management plan (DMP), in which they will specify what data will be open.

Note that the Commission does NOT require applicants to submit a DMP at the proposal stage.

A DMP is therefore NOT part of the evaluation.

DMPs are a deliverable for those participating in the pilot.

Page 16: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Where relevant*, H2020 proposals can include a section on data management which is evaluated under the criterion ‘Impact’

• What types of data will the project generate/collect?

• What standards will be used?

• How will this data be shared/made available? If not, why?

• How will this data be curated and preserved?

* For “Research and Innovation actions” and “Innovation Actions”

• DMPs are a project deliverable for those participating in the open data pilot.

• Not a fixed document – should evolve and gain precision

– Deliver first version within initial 6 months of project

– More elaborate versions whenever important changes to the project occur. At least at the mid-term and final review.

Info on RDM: what and when

PROPOSAL STAGE IN PROJECT

Page 17: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Initial DMP (at 6 months)

The DMP should address the points below on a dataset by dataset basis:

• Data set reference and name

• Data set description

• Standards and metadata

• Data sharing

• Archiving and preservation (including storage and backup)

Page 18: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

More elaborate DMPScientific research data should be easily:

1. DiscoverableAre the data and software discoverable and identifiable by a standard mechanism e.g. DOIs?

2. AccessibleAre the data and software accessible and under what conditions e.g. licenses, embargoes etc?

3. Assessable and intelligibleAre the data and software assessable and intelligible to third parties for scrutiny and peer-review? E.g. can judgements be made about their reliability and the competence of those who created them?

4. Useable beyond the original purpose for which it was collectedAre the data properly curated and stored together with the minimum software and documentation to be useable by third parties in the long-term?

5. Interoperable to specific quality standardsAre the data and software interoperable, allowing data exchange between researchers, institutions, countries etc? e.g. Adhering to standards and compliant with available applications

Page 19: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Feedback from the European Commission

LESSONS LEARNED

Image CC-BY-SA-ND by David D Wang https://www.flickr.com/photos/30326117@N08/3475108362

Page 20: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Slide from Daniel Spichtinger and Celina Ramjoue

Page 21: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Slide from Daniel Spichtinger and Celina Ramjoue

Page 22: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Lessons the EC has drawn (1)• Explantation is paramount!

– Misperception that 'open' bias will be evaluated positively

– Confusion: DMP versus data management section at submission stage

– Need to state that not everything must be open. In theory, it is possible to be in the ORD Pilot and not open any data.

– Emphasise flexibility (many opt-out / opt-in mechanisms)

• Emphasise the importance of feedback for policy in the next Framework Programme: being in the Pilot means co-shaping European policy on opening up research data

Content taken from slides by Daniel Spichtinger, Celina Ramjoue, Jarkko Siren and Jean-Francois

Dechamp

Page 23: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Lessons the EC has drawn (2)

• Helps to re-frame ORD Pilot as "Data Management Pilot"  – Stress the fact that researcher has freedom and responsibility via

DMP. Excellent research must include excellent data management.

– Underline overall aim: kick-start a virtous circle and change of culture

• Questions about eligibility of data management costs

• Tools and support needed for data management / DMPs Content taken from slides by Daniel Spichtinger,

Celina Ramjoue, Jarkko Siren and Jean-Francois Dechamp

Page 24: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

How to comply with the pilot requirements?

OPEN RESEARCH DATA

Image CC-BY-NC-SA by Tom Magllery www.flickr.com/photos/lwr/13442910354

Page 25: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

How can researchers make data open?

1. Choose the dataset(s) to share• What can be made open? This step may need to be

revisited if problems are encountered later.

2. Apply an open license 

• Determine what IP exists. Apply a suitable licence e.g. CC-BY

3. Make the data available • Provide the data in a suitable format. Use repositories.

 

4. Make it discoverable • Post on the web, get a unique ID, register in catalogues…

https://okfn.org

Page 26: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

http://www.dcc.ac.uk/resources/how-guides/license-research-data

Licensing research data openly

This DCC guide outlines the pros and cons of each approach and gives practical advice on how to implement a data licence

Page 27: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Which licenses are appropriate?

Creative Commons clauses that limit sharing

NC NonCommercialWhat counts as commercial?ND NoDerivativesSeverely restricts use

These clauses are not open licenses

Horizon 2020 Open Access guidelines point to:

or

Page 28: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

EUDAT licensing tool

Researchers can answer a series of questions to determine which licence(s) are appropriate to use

http://ufal.github.io/public-license-selector

Page 29: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Deposit in research data repositories

http://service.re3data.org/search

The EC guidelines point to Re3data as one of the registries that can be searched to find a home for data

Page 30: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Zenodo is a multi-disciplinary repository that can be used for the long-tail of research data

• An OpenAIRE-CERN joint effort

• Multidisciplinary repository accepting– Multiple data types– Publications– Software

• Assigns a Digital Object Identifier (DOI)

• Links funding, publications, data & software

www.zenodo.org

Zenodo

Page 31: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Metadata and documentation• Metadata is basic descriptive information to help others identify and

understand the structure of the data e.g. title, author...

• Documentation provides the wider context e.g. the methodology / workflow, software, tools and any information needed to understand and reuse the data

• Relevant standards should be used for interoperability – check out the Metadata Standards Directory from the Research Data Alliance

http://rd-alliance.github.io/metadata-directory

Page 32: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

OpenAIRE

http://vimeo.com/108790101

Open Access Infrastructure for research in Europe

• aggregates data on OA outputs

• mines & enriches it content by linking thing together

• provides services & APIs e.g. to generate publication lists and help with project reporting

www.openaire.eu

Page 33: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

EUDAT servicesEUDAT offers a pan-European solution, providing a generic set of services to ensure minimum level of interoperability

Building common data services in close collaboration with 25+ communities

www.eudat.eu

Page 34: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

FOSTER projectFacilitate Open Science Training for European Research

• Network of open access trainers

• Programme of open science courses

• Portal to training materials

• E-learning courses on open access and open data

www.fosteropenscience.eu

Page 35: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

DMPonline

A web-based tool to help researchers write DMPs

Includes a template for Horizon 2020

https://dmponline.dcc.ac.uk

Page 37: Horizon 2020 Open Research Data Pilot · Horizon 2020 Open Research Data Pilot Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Webinar for the moderated

Thanks – any questions?

For more information check out the FOSTER e-learning course on the Horizon 2020 Open Research Data pilotwww.fosteropenscience.eu/content/horizon-2020-open-research-data-pilot-0