19
University Library. Report to the University of Sheffield Research Data Management Service Delivery Group Institutional Research Data Management Technical Infrastructures Date: 14/11/2014 Author: John A. Lewis

Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

University Library.

Report to the University of Sheffield Research Data Management Service Delivery Group

Institutional Research Data Management Technical Infrastructures

Date: 14/11/2014

Author: John A. Lewis

Page 2: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Institutional Research Data Management Technical Infrastructures

The institutional research data management technical infrastructure provides the technical facilities for registering, cataloguing, storing, preserving and sharing research data. This document lists UK Universities that have established RDM technical infrastructures, to some extent, and briefly describes the main components of the infrastructure. A number of other UK universities that are planning such infrastructures, and some that are hosting discipline based repositories, are also listed and the infrastructure briefly described.

1. UK Universities with Research Data Repositories or Catalogues

1.1. Bristol http://data.bris.ac.uk/data/ [Fig.1] CKAN has been selected for the data repository and functions as a catalogue of research data. This integrates with the PURE CRIS, which functions as a catalogue of research outputs. The architecture is described at http://data.blogs.ilrt.org/2012/02/03/data-bris-architecture/.

1.2. Cambridge DSpace https://www.repository.cam.ac.uk/ [Fig.2] The DSpace platform institutional repository is now able to preserve and publish research data.

1.3. Edinburgh Datashare http://datashare.is.ed.ac.uk/ [Fig.3] This repository is based on DSpace. The technical infrastructure at Edinburgh involves integration with PURE, active data infrastructure and the DMPonline tool. Development of the infrastructure is discussed at http://libraryblogs.is.ed.ac.uk/blog/2013/12/06/the-four-quadrants-of-research-data-curation-systems/. 1.4. Essex Research Data http://researchdata.essex.ac.uk/ [Fig.4] This data repository is built on the EPrints platform, modified using the ReCollect plugin to accept datasets, as described in http://researchdataessex.wordpress.com/2014/09/. The service includes allocation of Datacite DOIs. Guidance is published at http://www.data-archive.ac.uk/media/391123/rdessex_recollectuserguide.pdf.

1.5. Open Research Exeter https://ore.exeter.ac.uk/repository/ [Fig.5] Institutional repository based on the DSpace platform, material may be deposited via Symplectic, although it is recommended to upload data via the ORE interface. ORE’s content includes journal articles, conference papers, working papers, reports, book chapters, videos, audio, images, multimedia research project outputs, raw data and analysed data. Exeter's three former repositories (The Exeter Research and Institutional Content Archive (ERIC), Digital Collections Online (DCO) and the Exeter Data Archive (EDA)) were merged into ORE in March 2013. The project final report at https://ore.exeter.ac.uk/repository/bitstream/handle/10871/14845/open_exeter_final_report_FINAL.pdf describes the development of the repository. 1.6. GSoA RADAR http://radar.gsa.ac.uk/ This EPrints based Glasgow School of Art institutional repository accepts a wide range of objects including research data. This repository was the subject of a case study for the KAPTUR project.

1.7. Glyndwr University Research Data Catalogue http://glynfo.glyndwr.ac.uk/course/view.php?id=41&section=11 This is a data catalogue facility of the bespoke CRIS.

1.8. Goldsmiths Research Data Catalogue http://eprints-data.gold.ac.uk/ Goldsmiths research data catalogue is built on the EPrints platform and results from the work done

for the KAPTUR project.

Page 3: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

1.9. Hertfordshire UHRA http://rdm.herts.ac.uk/rdm/uh-research-archive.html [Fig.6] This is a DSpace institutional repository that is being expanded to include a data catalogue and a research data archive http://www.herts.ac.uk/rdm/finishing/uh-research-archive.

1.10. Hull Hydra https://hydra.hull.ac.uk/ [Fig.7] The institutional repository at Hull is built on the Hydra micro-services architecture, which involves the Fedora repository system. The repository is designed to hold a wide range of digital resources including research datasets. The development of the infrastructure is described at http://www.dcc.ac.uk/resources/developing-rdm-services/storing-sharing-data-hull.

1.11. University of Lincoln Researcher Dashboard https://orbital.lincoln.ac.uk/ [Fig.8] The Researcher Dashboard is the interface for the Data deposit workflow, facilitated by the ‘Orbital Bridge’ application. This links the various components of the RDMI: a CKAN based data registry, an EPrints IR for published research papers, network storage and Lincoln’s Awards Management System. The infrastructure architecture is discussed at http://orbital.blogs.lincoln.ac.uk/2012/12/05/orbital-ams-ckan-eprints-datacite/ and http://orbital.blogs.lincoln.ac.uk/2012/12/06/orbital-deposit-of-dataset-records-to-the-lincoln-repository-workflow/.

1.12. LSE http://eprints.lse.ac.uk/

Researchers are recommended to deposit data in ‘LSE Research Online’; the EPrints institutional repository (which may now contain datasets). The LSE Digital Library http://digital.library.lse.ac.uk uses the Hydra platform for preservation and access of digital collections (using Fedora and SOLR but not the Blacklight discovery layer); interoperability between the Digital Library and the ePrints repository is planned.

1.13. University of Newcastle https://research.ncl.ac.uk/rdm/tools/ [Fig.9] A Research Data infrastructure has been implemented at Newcastle which includes a CKAN data portal (for archiving and publishing data) together with a number of in-house built systems – a MyProject (a project and awards management system), MyImpact (a researcher profile and publication information system), a Research Data Catalogue (linking data, projects and publications), a VRE and e-Science Central (research collaboration tools). Architectural considerations are discussed in the document at http://research.ncl.ac.uk/media/sites/researchwebsites/iridium/iridium_research_data_catalogue_specification_07_6_2013_v1_PT.pdf.

1.14. Oxford DataBank https://databank.ora.ox.ac.uk/ [Fig.10] Databank is the Fedora based data repository for the University of Oxford. Data may be stored and preserved in the long-term, retrieved and published from anywhere on the web. This is a component of the DataFlow infrastructure at Oxford, alongside DataStage which provides local management of active research data, including metadata annotation and a collaborative workflow, and the data catalogue, DataFinder. The RDM infrastructure also includes the Online Research Database Service (ORDS) http://ords.ox.ac.uk/ and the institutional repository, Oxford University Research Archive (ORA) http://ora.ox.ac.uk/. Researchers may upload full-text articles to ORA using Symplectic. ORA-Data will replace Databank as the archival store for digital data produced resulting from research by Oxford academics. This system will be Fedora based, with Hydra microservices managing workflows. ORA-Data is provided by the Bodleian Libraries and complements other data archives by providing a local archive for data not deposited elsewhere. ORA-Data will be used to store data that underpins scholarly publications, so that the data can be cited and accessed. ORA-Data is designed to hold records of datasets, irrespective of their location, if the actual data are stored elsewhere. A metadata record for datasets can be created and made available in ORA-Data that includes a link to the location of the dataset, and be harvested by Datafinder, Oxford’s dataset

Page 4: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

catalogue. DataStage facilitates the deposit of data (with accompanying metadata) into ORA-Data and other data repositories. http://www.bodleian.ox.ac.uk/bdlss/digital-services/data-archiving.

1.15. Oxford Brookes https://radar.brookes.ac.uk/radar/access/home.do

The Equella based ‘Research Archive and Digital Asset Registry’ is to be used to catalogue research

datasets.

1.16. C4DM-RDR http://c4dm.eecs.qmul.ac.uk/rdr/ The Research Data Repository at Queen Mary University of London, Centre for Digital Music is a Dspace based repository, specifically configured for long-term preservation and sharing of multimedia file formats.

1.17. St. Andrews https://risweb.st-andrews.ac.uk/portal/en/

The Pure Research Portal holds records of Datasets and other research outputs, and provides a data

registry / catalogue, for data held locally in access storage and archive storage. The DSpace

Institutional Repository, Research@StAndrews:FullText http://research-repository.st-

andrews.ac.uk/, currently holds full text papers, submitted through Pure. A research data repository

is to be piloted soon.

1.18. ePrints Southampton http://eprints.soton.ac.uk [Fig.11] The EPrints institutional repository at the University of Southampton has extended the existing the list of data types accepted to include datasets and experiments, using the ReCollect plugin. EPrints at Southampton now holds research data underlying published research (papers) outputs. Another strand of work, using Sharepoint to catalogue and share active data, has yet to be implemented. The University of Southampton has a federated approach to repository management and so there are a number of instances of ePrints being used by departments to curate their research outputs. Aspects of data systems architecture at Southampton are discussed in blog posts at http://datapool.soton.ac.uk/tag/data-systems-architecture/.

1.19. University of the Arts London Data Repository http://www.researchdata.arts.ac.uk/ This repository for research data is built on an EPrints platform.

1.20. UCA Research Online http://www.research.ucreative.ac.uk/ UCARO is the institutional repository and accepts a wide range of research outputs including research data. This is built on the EPrints platform.

1.21. UWE Research Data Repository http://researchdata.uwe.ac.uk/ [Fig.12] An instance of EPrints was modified for use as the data repository at UWE. The project developed its own metadata profile for research data, having decided against subscribing to the Datacite scheme and before the Recollect plugin became available http://www2.uwe.ac.uk/services/library/using_the_library/Services%20for%20researchers/Web%20version%20Objectives%20requirements%20and%20standards%20for%20a%20data%20repository%20v4.pdf.

1.22. Warwick http://wrap.warwick.ac.uk/ Researchers ‘register’ their datasets with the institution through WRAP (the EPrints based institutional repository) in the same way as they would give information about publications. Datasets may be deposited in WRAP.

Page 5: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

2. UK Universities with Research Data Repositories in development or not yet existing

2.1. Bath – Considering ePrints for the Data Registry, linking with PURE for the Data Catalogue,

possibly on top of Arkivum for data preservation.

2.2. Birmingham - planned development: BEAR Store, a data archive for long term storage, particularly of data associated with publications. EPrints based institutional repository http://eprints.bham.ac.uk/.

2.23. Cardiff – Nothing provided or planned. Eprints based institutional repository http://orca.cf.ac.uk/.

2.4. Durham – Nothing provided or planned. Eprints based institutional repository http://eprints.dur.ac.uk/.

2.5. Glasgow – Pilot EPrints research data repository with links to award information. Eprints based institutional repository, Enlighten http://eprints.gla.ac.uk/.

2.6. ICL – Library becoming licensed to mint DOIs and will be developing a data catalogue. Researchers are advised that the preferred repository will be the discipline specific repository, failing that they should use the Imperial College repository: the DSpace based institutional repository, Spiral http://spiral.imperial.ac.uk/.

2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories is planned for delivery by the end of the 2014-15 session. ‘Research Portal’ is the PURE based institutional repository https://kclpure.kcl.ac.uk/portal/.

2.8. Lancaster – Hydra will possibly be used for the data repository. The Pure CRIS is to hold records

of Research Data outputs, to be made publicly available through Pure Portal. The CRIS may be

modified so that small datasets may also be stored. http://www.lancaster.ac.uk/library/information-

for/researchers/research-data-management/data-and-pure/

2.9. Leeds – Pilot Eprints data repository with Symplectics link, being tested as a continuation of the Roadmap project. Two instances of ePrints are envisaged, one for data repository, the other for Registry / catalogue. Have an EPrints based institutional repository http://eprints.whiterose.ac.uk/.

2.10. Liverpool – Nothing provided or planned. Currently have an EPrints based institutional repository https://research-archive.liv.ac.uk/.

2.11. Loughborough - Have announced that their infrastructure will involve Figshare for Institutions,

Arkivum and Symplectic http://ow.ly/DUAug.

2.12. Manchester – The University of Manchester Library is providing a Research Data Management service. No catalogue is yet available, but metadata is stored with DMP in the Research Data Management System. A data storage service is provided, and the University is currently working to provide a research data repository service. Manchester’s institutional repository, eScholar, is a Fedora institutional repository (supports submission of 16 main types of scholarly output) https://www.escholar.manchester.ac.uk/.

2.13. Nottingham - work is currently underway to roll out an institutional data repository/catalogue. Nottingham EPrints institutional repository content policy allows it to hold all types of materials except theses and dissertations http://eprints.nottingham.ac.uk/.

Page 6: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

2.14. QMU - QMRO, the institutional repository is on the DSpace platform, and this is likely to be the platform for the institutional data repository. The Centre for Digital Music (C4DM) at Queen Mary University of London has a DSpace repository for Digital Music Data (see above).

2.15. Queen’s University Belfast – No data repository / catalogue yet. Two outward facing repository systems: ‘QUB Research Portal’ http://pure.qub.ac.uk/portal/ is PURE based, and ‘Qcite’ http://qcite.qub.ac.uk/, the institutional repository, is DSpace based. Both hold only papers.

2.16. Sheffield – A number of systems are being considered for the RDM technical infrastructure, including ePrints (currently used for the shared institutional repository), Symplectic Elements (the CRIS), various digital preservation systems, Hydra, ‘Figshare for Institutions’ and Arkivum.

2.17. Strathclyde – considering ePrints data registry / repository, with PURE as a data catalogue.

2.18. Surrey – RDM Roadmap task list includes identification of metadata catalogue – possibly

Symplectic https://www.surrey.ac.uk/research/researchdata/RDM%20Roadmap(V2).pdf.

2.19. UCL – No data repository / catalogue, but UCL Research Data Archive and access facility are planned for pilot late 2014. The institutional repository, UCL Discovery, is run on the EPrints platform. UCL hosts CAVA http://www.ucl.ac.uk/ls/cava/, a Human Communication AV Archive with storage on the Library Services Digital Collections service (ExLibris Digitool platform).

2.20 UEL – Considering the use of ePrints repository for registry and catalogue (no CRIS) above

Arkivum for storage.

2.21. York - The University Information Directorate is working towards providing a University data catalogue and repository for data. ‘YODL’ https://dlib.york.ac.uk/yodl/app/home/index, a DAMS based on Fedora Commons platform, currently holds some Humanities research data. One proposal involves YODL holding accessible data, the Archivematica digital preservation system to manage archival storage, and the CRIS, Pure, providing a Data registry and Data catalogue (Pure Portal) which facilitates discovery and access to the data https://risweb.st-andrews.ac.uk/portal/en/activities/eurocris-strategic-membership-meeting%28db41094d-4635-4803-9e57-01b856e1de12%29.html.

Page 7: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

3. Discipline-based research data repositories hosted by UK HEIs

3.1. ADS (Archaeology Data Service) http://archaeologydataservice.ac.uk/ A disciplinary data repository based at the University of York; infrastructure includes the Fedora Commons repository platform.

3.2. Edina ShareGeo http://edina.ac.uk/projects/sharegeo/ Not an institutional repository, but based at The University of Edinburgh, here, DSpace has been customised to offer a repository that eases both the deposit and discovery of geospatial data.

3.3. Leeds DART Data Portal http://dartportal.leeds.ac.uk/ The Detection of Archaeological Residues using Remote-sensing Techniques (DART) research project maintains a CKAN data portal for the open data outputs from the project.

3.4. eCrystals at the University of Southampton http://ecrystals.chem.soton.ac.uk/ The University of Southampton department of Chemistry holds data from X-ray diffraction experiments in an EPrints repository.

3.5. UKDA http://www.data-archive.ac.uk/ Not an institutional, but a national social and economic research data repository based at the University of Essex. The UKDA provides the UK Data Service, which curates key quantitative and qualitative data, UK Data Service ReShare, curating data from ESRC funded research and the HDS

(successor to the AHDS). These are housed on a modified EPrints repository platform.

3.6. CARMEN Portal http://www.carmen.org.uk/portal The CARMEN Portal is a VRE to support e-Neuroscience, providing storage and processing services

over a Grid infrastructure. The CARMEN system is a three-tier web architecture consisting of a web

portal, an application layer and a storage layer, developed by a collaboration of researchers from 11

UK universities. The Java portal allows the user to access data and to create and run analysis tool on

remote servers. The storage layer is shared between MySQL databases and a SRB (Storage Resource

Broker) system. The application layer consists of Java servlets, providing a middleware layer that

bridges storage and portal.

John A. Lewis 14/11/2014

Page 8: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

CKAN Public

Research Data

Catalogue

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Researcher & Project metadata

Dataset Discovery

Automatic Metadata

Capture

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Au

tom

atically cap

ture

d m

etad

ata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata Exte

rnal

SWO

RD

2

SWO

RD

2

Researcher & Project metadata

Research Data Flow Metadata Flow

Institu

tion

/ V

irtual O

rg

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d

<- P

rivate

Pu

blic ->

Datase

t Pu

blicatio

n

Datase

t Pu

blicatio

n

<- Data Metadata ->

Data Capture

Instrument

PURE

PURE Portal Institutional Repository

CKAN Active Data

Registry

DMP Service

DMPlans

(Man

ual U

plo

ad)

Researcher

Datase

t Acce

ss & R

eu

se

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Datase

t Pu

blicatio

n

Datase

t Acce

ss by e

xtern

al collab

orato

rs be

fore

pro

ject e

nd

Me

tadata H

arvest o

n

Datase

t Pu

blicatio

n

1. DataBris

HR & RMS

Research Data Storage Facility

Active Data Storage

Local Research Data Archive (Curated)

Research Data for Public Access

Researcher

Metadata on Dataset Publication

Page 9: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

Au

tom

atic Me

tadata

Cap

ture

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Au

tom

atically captu

red

me

tadata &

Data P

roce

ssing A

ctive Data

Arch

ive D

ata SW

OR

D2

SWO

RD

2

Researcher & Project metadata

Research Data Flow Metadata Flow

External Research

Data Archive ?

Datase

t Me

tadata o

n

Datase

t Pu

blicatio

n

<- Private

P

ub

lic ->

Datase

t Pu

blicatio

n

File m

etad

ata

<- Data Metadata ->

Dataset Preservation

Data Capture

Instrument

DMP Service

DMPlans

DSp

ace C

on

ne

ctor

Researcher

Researcher & Project metadata

Symplectic Elements

HR & RMS

Re

search

er &

Pro

ject m

etad

ata

Dataset Access & Reuse

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Locally Curated Dataset Metadata & Description at

project end (Manual Upload)

Extern

al

Institu

tion

/ V

irtual O

rg

Datase

t Pu

blicatio

n

2. DSpace Cambridge

Data Registry

Cache for Public Access

Local Research Data Archive

Dspace Institutional Repository

Datase

t Pu

blicatio

n

Active Data Storage

Data Catalogue

Datase

t De

po

sit at pro

ject e

nd

Researcher

SWO

RD

2

Datase

t Pu

blicatio

n ?

Datase

t Acce

ss & R

eu

se

Page 10: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Catalogue Pure / Dspace ?

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

Au

tom

atic Me

tadata

Cap

ture

Automatically captured metadata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata

SWO

RD

2

Research Data Flow Metadata Flow

Research Data Vault

(Arkivum?)

Dataset Metadata on Dataset Publication

Datase

t Me

tadata &

Do

cum

en

tation

on

Pu

blicatio

n to

exte

rnal R

ep

osito

ry

Dataset Deposit at project end

<- Private

P

ub

lic ->

Datase

t P

ub

lication

<- Data Metadata ->

Data Capture

Instrument

DMP Service

?

DMPlans

Locally C

urate

d D

ataset M

etad

ata & D

escrip

tion

at pro

ject e

nd

(M

anu

al Up

load

)

Researcher

Dataset Access & Reuse

SWO

RD

2

Datase

t Pu

blicatio

n ?

Datase

t Acce

ss & R

eu

se

Extern

al

Institu

tion

/ V

irtual O

rg

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Researcher & Project metadata

3. Edinburgh Datashare

Data Asset Registry (Pure)

Data Storage for Public

Access

DataShare (DSpace)

Active Data Storage

DSp

ace M

etadata Sto

re

SWO

RD

2

HR & RMS

Researcher

SWORD2

Datase

t Me

tadata

Harve

st OA

I-PM

H

Datase

t Pu

blicatio

n

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

) ?

Page 11: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

Au

tom

atic Me

tadata

Cap

ture

Datase

t Me

tadata H

arvest O

AI-P

MH

Automatically captured metadata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata

SWO

RD

2

Researcher & Project metadata

Research Data Flow Metadata Flow

<- Private

P

ub

lic ->

(File metadata)

<- Data Metadata ->

Data Capture

Instrument

CRIS?

Essex Research Data

(ePrints) Cache for

Public Access

ePrints Data

Storage ePrints

Metadata Catalogue

DMP Service

?

DMPlans

Active Data Storage

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d (M

anu

al Up

load

)

eP

rints C

on

ne

ctor

Researcher

Researcher & Project metadata

Dataset Access & Reuse

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Datase

t Acce

ss & R

eu

se

Extern

al

Institu

tion

/ V

irtual O

rg

SWO

RD

2

SWO

RD

2

Datase

t Pu

blicatio

n

Datase

t Pu

blicatio

n

Me

tadata H

arvest o

n

Datase

t Pu

blicatio

n

4. Essex Research Data

Researcher

HR & RMS

External Research

Data Archive ?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Page 12: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

Au

tom

atic Me

tadata

Cap

ture

Datase

t Me

tadata H

arvest O

AI-P

MH

Automatically captured metadata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata

Researcher & Project metadata

Research Data Flow Metadata Flow

<- P

rivate

Pu

blic ->

(File metadata)

<- Data Metadata ->

Data Capture

Instrument

Symplectic

Open Research

Exeter (DSpace) Cache for

Public Access

DSpace Data

Storage DSpace

Metadata Catalogue

DMP Service

?

DMPlans

Active Data Storage

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d (M

anu

al Up

load

)

DSp

ace C

on

ne

ctor

Researcher

Researcher & Project metadata

Dataset Access & Reuse

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Extern

al

Institu

tion

/ V

irtual O

rg

SWO

RD

2

SWO

RD

2

Datase

t Pu

blicatio

n

SWO

RD

2

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

Me

tadata H

arvest o

n

Datase

t Pu

blicatio

n

5. Open Research Exeter

Researcher

HR & RMS

External Research

Data Archive ?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Page 13: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

Automatic Metadata Capture

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Data P

roce

ssing A

ctive Data

Arch

ive D

ata Exte

rnal

SWO

RD

2

Research Data Flow Metadata Flow

Institu

tion

/ V

irtual O

rg

Datase

t Me

tadata o

n D

ataset P

ub

lication

<- P

rivate

Pu

blic ->

Datase

t Pu

blicatio

n

Locally curated?

<- Data Metadata ->

Data Capture

Instrument

Researcher

Researcher & Project metadata

HR & RMS

Researcher & Project metadata

Dataset Access & Reuse

Datase

t Pu

blicatio

n

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Dataset Metadata

Sync

Sharepoint Active Data Metadata Registry

Active Data Storage

Me

tadata H

arvest

6. Herts UHRA

Dataset Metadata & Description (Manual upload)

DMP Service

DMPlans

Dataset Metadata & Description at project end

Researcher

UHRA (DSpace)

Cache for Public Access

DSpace Data

Storage DSpace

Metadata Store

PURE

(PURE Portal) Research

Data Catalogue

SWO

RD

2

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

Externally curated?

Dataset Discovery?

External Research

Data Archive (Arkivum)

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Page 14: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Re

search

er

& P

roje

ct m

etad

ata

Dataset Discovery

Automatic Metadata Capture

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Datase

t Me

tadata &

De

scriptio

n

Data P

roce

ssing A

ctive Data

Arch

ive D

ata Exte

rnal SW

OR

D2

Researcher & Project metadata

Research Data Flow Metadata Flow

Institu

tion

/ V

irtual O

rg

Datase

t Me

tadata o

n

Datase

t Pu

blicatio

n

<- Private

P

ub

lic ->

Datase

t Pu

blicatio

n

File m

etad

ata

<- Data Metadata ->

Data Capture

Instrument

Converis

HR & URMS

Co

nve

riss Co

nn

ecto

r

Researcher

Re

search

er &

Pro

ject m

etad

ata

Dataset Access & Reuse

Datase

t Pu

blicatio

n

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Locally C

urate

d D

ataset M

etad

ata & D

escrip

tion

at pro

ject e

nd

(M

anu

al Up

load

)

Sharepoint & Sakai

Active Data Metadata Registry

Active Data

Storage

Data Catalogue (Hydra interface – Blacklight & SOLR)

Hull Hydra (Fedora)

7. Hull Hydra

DMP Service

DMPlans

Dataset Metadata & Description (Manual upload)

SWO

RD

2

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

Fedora Cache Public Access

Local Research Data Archive Data Registry?

Externally Curated Dataset Metadata

External Research

Data Archive ?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Researcher

Page 15: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

ePrints Data Catalogue

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Project metadata

Dataset Discovery

SWO

RD

2

Datase

t Me

tadata

Harve

st OA

I-PM

H

Au

tom

atically captu

red

me

tadata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata Exte

rnal

SWO

RD

2

Research Data Flow Metadata Flow

Institu

tion

/ V

irtual O

rg

Dataset Metadata

Dataset Metadata on

Dataset Publication

<- Private

P

ub

lic ->

Datase

t Pu

blicatio

n

Datacite DOI

<- Data Metadata ->

Orbital Bridge Lincoln CKAN

Cache for Public Access

Local Research Data

Archive Data Registry

DMP Service

DMPlans Dataset Metadata & Description at project end (Manual Upload)

Researcher

Re

search

er m

etad

ata

Datase

t Acce

ss & R

eu

se

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

) 8. Lincoln Orbital

Researcher metadata Nucleus

AMS

SWO

RD

2

Researcher & Project metadata

ePrint record URL

Owncloud Active Data Metadata Registry

Active Data Storage

SWO

RD

2

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

Researcher Automatic Metadata Capture

Data Capture

Instrument

External Research

Data Archive ?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Page 16: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Researcher & Project metadata

Dataset Discovery

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Au

tom

atically captu

red

me

tadata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata

SWO

RD

2

Research Data Flow Metadata Flow

Dataset Metadata on Dataset Publication

<- Private

P

ub

lic ->

Datase

t Pu

blicatio

n

<- Data Metadata ->

My Projects

DMP Online

DMPlans

Researcher

Dataset Access & Reuse

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d (M

anu

al Up

load

)

Extern

al

Institu

tion

/ V

irtual O

rg

Dataset Metadata Harvest

9. Newcastle

VRE & E-Science Central

Active Data Metadata Registry

Active Data Storage

Automatic Metadata Capture

Data Capture

Instrument

Newcastle

CKAN Cache for

Public Access

Local Research Data Archive

CKAN Metadata

Store (Catalogue)

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

My Impact

Research Data

Catalogue

Researcher

External Research

Data Archive ?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Page 17: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Data P

roce

ssing A

ctive Data

Arch

ive D

ata

SWO

RD

2

Research Data Flow Metadata Flow

Dataset Metadata on

Dataset Publication

<- Private

P

ub

lic ->

Datase

t Pu

blicatio

n

<- Data Metadata ->

DMP Service

DMPlans

Researcher

Dataset Access & Reuse

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d (M

anu

al Up

load

)

Extern

al

Institu

tion

/ V

irtual O

rg

10. Oxford ORA-DATA

DataStage Active Data Metadata Registry

Active Data Storage

Automatic Metadata Capture

Data Capture

Instrument

ORA-Data (Fedora &

Hydra)

DataBank

DataFinder

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

Researcher & Project metadata

Symplectic Elements

HR & RMS

Re

search

er &

Pro

ject m

etad

ata

External Research

Data Archive (Arkivum)?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Dataset Metadata & Description

(Manual Upload)

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d

DataBank

Researcher

Page 18: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Au

tom

atically captu

red

me

tadata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata

SWO

RD

2

Research Data Flow Metadata Flow

Dataset Metadata on

Dataset Publication

<- Private

P

ub

lic ->

Datase

t Pu

blicatio

n

<- Data Metadata ->

DMP Service

DMPlans

Researcher

Dataset Access & Reuse

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d (M

anu

al Up

load

)

Extern

al

Institu

tion

/ V

irtual O

rg

Metadata Harvest

11. ePrints Soton

Sharepoint Active Data Metadata Registry

Active Data Storage

Automatic Metadata Capture

Data Capture

Instrument

ePrints Soton

Archive Data Storage

Data Registry

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

Researcher & Project metadata

CRIS (bespoke)

HR & RMS

Researcher & Project metadata

Cache for Public Access

Data Catalogue

External Research

Data Archive ?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Researcher

Page 19: Institutional Research Data Management Technical ... · 2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories

Data Centre / Disciplinary Repository

Data Registry Data Archive /

Storage

Dataset Discovery

SWO

RD

2

Datase

t Me

tadata H

arvest O

AI-P

MH

Au

tom

atically captu

red

me

tadata

Data P

roce

ssing A

ctive Data

Arch

ive D

ata

SWO

RD

2

Research Data Flow Metadata Flow

Dataset Metadata on

Dataset Publication

<- Private

P

ub

lic ->

Datase

t Pu

blicatio

n

<- Data Metadata ->

DMP Service

DMPlans

Researcher

Dataset Access & Reuse

Extern

ally Cu

rated

Datase

t Me

tadata &

Do

cum

en

tation

on

Datase

t Pu

blicatio

n (M

anu

al up

load

)

Datase

t Me

tadata &

De

scriptio

n at p

roje

ct en

d (M

anu

al Up

load

)

Extern

al

Institu

tion

/ V

irtual O

rg

12. UWE

Sharepoint Active Data Metadata Registry

Active Data Storage

Automatic Metadata Capture

Data Capture

Instrument

UWE Research

Data Repository

(ePrints)

Archive Data Storage

Data Registry

Datase

t Acce

ss & R

eu

se

Datase

t Pu

blicatio

n

Researcher & Project metadata

CRIS (bespoke)

HR & RMS

Researcher & Project metadata

Cache for Public Access

Data Catalogue

External Research

Data Archive ?

Datase

t De

po

sit at pro

ject e

nd

Dataset Preservation

Researcher