23
dans.knaw.nl DANS is an institute of KNAW and NWO FAIR data in trustworthy repositories EOSC Symposium 2019 Social & Cultural Data - taking the Users’ Perspective Ilona von Stein - DANS @DANSKNAW @FAIRsFAIR_EU 28 Nov 2019, Budapest

FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

dans.knaw.nlDANS is an institute of KNAW and NWO

FAIR data in trustworthy repositories

EOSC Symposium 2019

Social & Cultural Data - taking the Users’ Perspective

Ilona von Stein - DANS

@DANSKNAW @FAIRsFAIR_EU

28 Nov 2019, Budapest

Page 2: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Outline

• FAIR data principles

• Data repositories

• FAIR-enabling data repositories

• FAIR data assessment: levels

• Conclusion and discussion

© Marjan Grootveld

Page 3: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

DANS is about keeping data FAIR

Institute of

Dutch Academy

and Research

Funding

Organisation

(KNAW & NWO)

since 2005

First predecessor

dates back to

1964 (Steinmetz

Foundation),

Historical Data

Archive 1989

Mission:

promote and

provide

permanent

access to digital

research

resources

Page 4: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Core issue in research

Trust is a central element

The data re-user wants to know:

• Where do these data come from?

• How were they collected?

• What has happened with themalong the way?

The data producer wants to know:

• How can I be sure that “they” interpret and use my data in the right way?

Illustration by Jørgen Stamp

digitalbevaring.dk CC BY 2.5 Denmark

Page 5: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

FAIR Guiding Principles

CREATING DATA

PROCESSING DATA

ANALYSING DATA

PRESERVING DATA

GIVING ACCESS TO

DATA

RE-USING DATA

Simplified research data life cycle based on: https://www.ukdataservice.ac.uk/manage-data/lifecycle

Managing and documenting data through all stages helps to build trust.

http://www.nature.com/articles/sdata201618www.force11.org/group/fairgroup/fairprinciples

Page 6: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Everybody loves FAIR!

Everybody wants to be FAIR, ▪ but what does that mean? ▪ how to put the principles into practice?▪ and how to measure FAIRness?

Images by kenwoodpress.com, Good Ware by flaticon.com, freebeesupply.com, openlibrary.org

Page 7: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

FAIRytale?

“Research data will not become nor stay FAIR by magic. We need skilled people, transparent processes, interoperable technologies and

collaboration to build, operate and maintain research data infrastructures.”

Mari Kleemola, Finnish Social Science Data Archive/CoreTrustSeal Board, Secretary

https://tietoarkistoblogi.blogspot.com/2018/11/being-trustworthy-and-fair.html

Icon by Freepik from flaticon.com

Page 8: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

FAIR digital object / FAIR ecosystem

A model for FAIR Digital Objects The components of a FAIR Ecosystem

Turning FAIR data into reality, Final report and Action Plan from the European Commission Expert Group on FAIR Data

Page 9: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Data repositories

A data repository [is a virtual place that] preserves, manages and provides access to many types of digital materials in a variety of formats*

• Institutional (institution or department)• Discipline specific (research fields or subjects)• Generic

Each may have specific requirements concerning e.g.

• data reuse • file format and data structure• types of metadata that can be used

Icon: https://icon-library.net/icon/data-repository-icon-5.html CCO Public Domain License

* CoreTrustSeal glossary (coretrustseal.org), taken from the CASRAI dictionary (https://dictionary.casrai.org)

Page 10: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Now, to the users perspective ☺

Image by freepik.com

Why would they care about:

• FAIR data

• Data repositories

• FAIR-aligned data repositories

and

about the certification of all that?

Page 11: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Why to use a data repository?

Illustration: Ainsley Seago CC BY

• It makes life easier for researchers

• It builds scientific integrity and trust

• Your data remain:• accessible • understandable • reusable

Repositories make and keep your data FAIR

Page 12: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

How do repositories make data FAIR?

For example:

• by providing persistent unique identifiers• long-term findability, sustainable citations,

appropriate academic credit

• by supporting findability through a public catalogue• effective data discovery is key to data sharing

• by supporting you to add a usage license for the data• clear terms and conditions that meet legal requirements

• by implementing and promoting metadata standards• interoperability

See also “Top 10 FAIR Data & Software Things”. Zenodo. http://doi.org/10.5281/zenodo.3409968

Illustrations: Jørgen Stamp: Digitalbevaring (CC-BY from http://digitalbevaring.dk/digital-bevaring/)

Page 13: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

How do repositories keep data FAIR?

They provide the long-term stewardship of FAIR digital objects, including curation activities, to ensure that the data remains FAIR over time

• Support for data producers (e.g. on file formats)

• Support for data users (e.g. on citation)

See also Mokrane & Recker, 2019. CoreTrustSeal-certified repositories. Enabling Finadable, Accessible, Interoperable and Reusable (FAIR) data. https://ipres2019.org/static/pdf/iPres2019_paper_74.pdf

Image: https://datasupport.researchdata.nl

Page 14: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

How to find your ultimate repository? (1)

General steps for finding a repository are:• Use, if possible, a certified repository

• Use a disciplinary repository if there is one

• alternatively, use the institutional repository, if you have onewhere the data will also be preserved for the long term

For giving (i.e. archiving, sharing) and taking (i.e. reusing) data

Image: https://www.inlinepolicy.com/blog/can-data-sharing-survive-the-new-data-protection-regime

Page 15: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

How to find your ultimate repository? (2)

• re3data.org

• Global registry of research data repositories

• Funded by the German Research Foundation (DFG)

• Filtered search and browse options

Page 16: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Select a trustworthy repository

✓ Certified repositories are assessed against a set of guidelines to evaluate their trustworthiness

✓ A certified repository typically provides severalservices that ensure the FAIRness of your dataset.

✓ A few certification frameworks exist that can be used to assess the quality of a repository

✓ CoreTrustSeal is in common use for this

Page 17: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Worldwide network of core certified repositories

Page 18: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

FAIR data assessment: levels

DATA REPOSITORY

F4. (meta)data are registered or

indexed in a searchable resource

+ TECHNOLOGIES

+ PROCEDURES

+ EXPERTISE

+ PEOPLE

(META)DATA

F1. (meta)data are assigned a globally unique and persistent identifier

F2. data are described with rich metadata

F3. metadata clearly and explicitly include the identifier of the data it describes

Page 19: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Assessment work in FAIRsFAIR

Focus on:

• Evaluation and certification of repositories that enable FAIR data

• Assessment of FAIR data within a repository

The FAIRsFAIR-project aimsto supply practical solutionsfor the use of FAIR data principles

Duration:March 2019 – Feb 20222

Budget: €10 milion

22 partners from 8 member states

www.fairsfair.eu@FAIRsFAIR_EU

Page 20: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Evaluation and certification of repositories

• Support the FAIR alignment of certification schemes for data repositories, building on existing frameworks such as CoreTrustSeal

• Call for repositories involvement – in depth FAIR-aligned certification support - to expand the European network of trustworthy repositories enabling FAIR

• Provide an improved registry for finding and selecting relevant trustworthy repositories

Page 21: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Assessment of data FAIRness

• Two use-cases:

- Manual FAIR self-assessment tool for researchers - prior to depositing

- Automatic assessment tool for data repositories – existing datasets

• This will be tested through a number of pilots

Page 22: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

Takeaway messages

• Trust is a central element in data sharing and data reuse

• FAIR-aligned repositories enhance the accessibility, understandability and reusability of data over time

• FAIR data assessment can be done at different levels and must include infrastructure

• Certified repositories keep FAIR data FAIR

Icon by Freepik from flaticon.com

Page 23: FAIR data in trustworthy repositories€¦ · •by providing persistent unique identifiers •long-term findability, sustainable citations, appropriate academic credit •by supporting

dans.knaw.nlDANS is an institute of KNAW and NWO

Thanks for your attention!

[email protected]

@DANSKNAW @FAIRsFAIR_EU

dans.knaw.nl/en

Acknowledgements: Marjan Grootveld, Ingrid Dillo,

Mustapha Mokrane (DANS)