Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
University Library.
Report to the University of Sheffield Research Data Management Service Delivery Group
Institutional Research Data Management Technical Infrastructures
Date: 14/11/2014
Author: John A. Lewis
Institutional Research Data Management Technical Infrastructures
The institutional research data management technical infrastructure provides the technical facilities for registering, cataloguing, storing, preserving and sharing research data. This document lists UK Universities that have established RDM technical infrastructures, to some extent, and briefly describes the main components of the infrastructure. A number of other UK universities that are planning such infrastructures, and some that are hosting discipline based repositories, are also listed and the infrastructure briefly described.
1. UK Universities with Research Data Repositories or Catalogues
1.1. Bristol http://data.bris.ac.uk/data/ [Fig.1] CKAN has been selected for the data repository and functions as a catalogue of research data. This integrates with the PURE CRIS, which functions as a catalogue of research outputs. The architecture is described at http://data.blogs.ilrt.org/2012/02/03/data-bris-architecture/.
1.2. Cambridge DSpace https://www.repository.cam.ac.uk/ [Fig.2] The DSpace platform institutional repository is now able to preserve and publish research data.
1.3. Edinburgh Datashare http://datashare.is.ed.ac.uk/ [Fig.3] This repository is based on DSpace. The technical infrastructure at Edinburgh involves integration with PURE, active data infrastructure and the DMPonline tool. Development of the infrastructure is discussed at http://libraryblogs.is.ed.ac.uk/blog/2013/12/06/the-four-quadrants-of-research-data-curation-systems/. 1.4. Essex Research Data http://researchdata.essex.ac.uk/ [Fig.4] This data repository is built on the EPrints platform, modified using the ReCollect plugin to accept datasets, as described in http://researchdataessex.wordpress.com/2014/09/. The service includes allocation of Datacite DOIs. Guidance is published at http://www.data-archive.ac.uk/media/391123/rdessex_recollectuserguide.pdf.
1.5. Open Research Exeter https://ore.exeter.ac.uk/repository/ [Fig.5] Institutional repository based on the DSpace platform, material may be deposited via Symplectic, although it is recommended to upload data via the ORE interface. ORE’s content includes journal articles, conference papers, working papers, reports, book chapters, videos, audio, images, multimedia research project outputs, raw data and analysed data. Exeter's three former repositories (The Exeter Research and Institutional Content Archive (ERIC), Digital Collections Online (DCO) and the Exeter Data Archive (EDA)) were merged into ORE in March 2013. The project final report at https://ore.exeter.ac.uk/repository/bitstream/handle/10871/14845/open_exeter_final_report_FINAL.pdf describes the development of the repository. 1.6. GSoA RADAR http://radar.gsa.ac.uk/ This EPrints based Glasgow School of Art institutional repository accepts a wide range of objects including research data. This repository was the subject of a case study for the KAPTUR project.
1.7. Glyndwr University Research Data Catalogue http://glynfo.glyndwr.ac.uk/course/view.php?id=41§ion=11 This is a data catalogue facility of the bespoke CRIS.
1.8. Goldsmiths Research Data Catalogue http://eprints-data.gold.ac.uk/ Goldsmiths research data catalogue is built on the EPrints platform and results from the work done
for the KAPTUR project.
1.9. Hertfordshire UHRA http://rdm.herts.ac.uk/rdm/uh-research-archive.html [Fig.6] This is a DSpace institutional repository that is being expanded to include a data catalogue and a research data archive http://www.herts.ac.uk/rdm/finishing/uh-research-archive.
1.10. Hull Hydra https://hydra.hull.ac.uk/ [Fig.7] The institutional repository at Hull is built on the Hydra micro-services architecture, which involves the Fedora repository system. The repository is designed to hold a wide range of digital resources including research datasets. The development of the infrastructure is described at http://www.dcc.ac.uk/resources/developing-rdm-services/storing-sharing-data-hull.
1.11. University of Lincoln Researcher Dashboard https://orbital.lincoln.ac.uk/ [Fig.8] The Researcher Dashboard is the interface for the Data deposit workflow, facilitated by the ‘Orbital Bridge’ application. This links the various components of the RDMI: a CKAN based data registry, an EPrints IR for published research papers, network storage and Lincoln’s Awards Management System. The infrastructure architecture is discussed at http://orbital.blogs.lincoln.ac.uk/2012/12/05/orbital-ams-ckan-eprints-datacite/ and http://orbital.blogs.lincoln.ac.uk/2012/12/06/orbital-deposit-of-dataset-records-to-the-lincoln-repository-workflow/.
1.12. LSE http://eprints.lse.ac.uk/
Researchers are recommended to deposit data in ‘LSE Research Online’; the EPrints institutional repository (which may now contain datasets). The LSE Digital Library http://digital.library.lse.ac.uk uses the Hydra platform for preservation and access of digital collections (using Fedora and SOLR but not the Blacklight discovery layer); interoperability between the Digital Library and the ePrints repository is planned.
1.13. University of Newcastle https://research.ncl.ac.uk/rdm/tools/ [Fig.9] A Research Data infrastructure has been implemented at Newcastle which includes a CKAN data portal (for archiving and publishing data) together with a number of in-house built systems – a MyProject (a project and awards management system), MyImpact (a researcher profile and publication information system), a Research Data Catalogue (linking data, projects and publications), a VRE and e-Science Central (research collaboration tools). Architectural considerations are discussed in the document at http://research.ncl.ac.uk/media/sites/researchwebsites/iridium/iridium_research_data_catalogue_specification_07_6_2013_v1_PT.pdf.
1.14. Oxford DataBank https://databank.ora.ox.ac.uk/ [Fig.10] Databank is the Fedora based data repository for the University of Oxford. Data may be stored and preserved in the long-term, retrieved and published from anywhere on the web. This is a component of the DataFlow infrastructure at Oxford, alongside DataStage which provides local management of active research data, including metadata annotation and a collaborative workflow, and the data catalogue, DataFinder. The RDM infrastructure also includes the Online Research Database Service (ORDS) http://ords.ox.ac.uk/ and the institutional repository, Oxford University Research Archive (ORA) http://ora.ox.ac.uk/. Researchers may upload full-text articles to ORA using Symplectic. ORA-Data will replace Databank as the archival store for digital data produced resulting from research by Oxford academics. This system will be Fedora based, with Hydra microservices managing workflows. ORA-Data is provided by the Bodleian Libraries and complements other data archives by providing a local archive for data not deposited elsewhere. ORA-Data will be used to store data that underpins scholarly publications, so that the data can be cited and accessed. ORA-Data is designed to hold records of datasets, irrespective of their location, if the actual data are stored elsewhere. A metadata record for datasets can be created and made available in ORA-Data that includes a link to the location of the dataset, and be harvested by Datafinder, Oxford’s dataset
catalogue. DataStage facilitates the deposit of data (with accompanying metadata) into ORA-Data and other data repositories. http://www.bodleian.ox.ac.uk/bdlss/digital-services/data-archiving.
1.15. Oxford Brookes https://radar.brookes.ac.uk/radar/access/home.do
The Equella based ‘Research Archive and Digital Asset Registry’ is to be used to catalogue research
datasets.
1.16. C4DM-RDR http://c4dm.eecs.qmul.ac.uk/rdr/ The Research Data Repository at Queen Mary University of London, Centre for Digital Music is a Dspace based repository, specifically configured for long-term preservation and sharing of multimedia file formats.
1.17. St. Andrews https://risweb.st-andrews.ac.uk/portal/en/
The Pure Research Portal holds records of Datasets and other research outputs, and provides a data
registry / catalogue, for data held locally in access storage and archive storage. The DSpace
Institutional Repository, Research@StAndrews:FullText http://research-repository.st-
andrews.ac.uk/, currently holds full text papers, submitted through Pure. A research data repository
is to be piloted soon.
1.18. ePrints Southampton http://eprints.soton.ac.uk [Fig.11] The EPrints institutional repository at the University of Southampton has extended the existing the list of data types accepted to include datasets and experiments, using the ReCollect plugin. EPrints at Southampton now holds research data underlying published research (papers) outputs. Another strand of work, using Sharepoint to catalogue and share active data, has yet to be implemented. The University of Southampton has a federated approach to repository management and so there are a number of instances of ePrints being used by departments to curate their research outputs. Aspects of data systems architecture at Southampton are discussed in blog posts at http://datapool.soton.ac.uk/tag/data-systems-architecture/.
1.19. University of the Arts London Data Repository http://www.researchdata.arts.ac.uk/ This repository for research data is built on an EPrints platform.
1.20. UCA Research Online http://www.research.ucreative.ac.uk/ UCARO is the institutional repository and accepts a wide range of research outputs including research data. This is built on the EPrints platform.
1.21. UWE Research Data Repository http://researchdata.uwe.ac.uk/ [Fig.12] An instance of EPrints was modified for use as the data repository at UWE. The project developed its own metadata profile for research data, having decided against subscribing to the Datacite scheme and before the Recollect plugin became available http://www2.uwe.ac.uk/services/library/using_the_library/Services%20for%20researchers/Web%20version%20Objectives%20requirements%20and%20standards%20for%20a%20data%20repository%20v4.pdf.
1.22. Warwick http://wrap.warwick.ac.uk/ Researchers ‘register’ their datasets with the institution through WRAP (the EPrints based institutional repository) in the same way as they would give information about publications. Datasets may be deposited in WRAP.
2. UK Universities with Research Data Repositories in development or not yet existing
2.1. Bath – Considering ePrints for the Data Registry, linking with PURE for the Data Catalogue,
possibly on top of Arkivum for data preservation.
2.2. Birmingham - planned development: BEAR Store, a data archive for long term storage, particularly of data associated with publications. EPrints based institutional repository http://eprints.bham.ac.uk/.
2.23. Cardiff – Nothing provided or planned. Eprints based institutional repository http://orca.cf.ac.uk/.
2.4. Durham – Nothing provided or planned. Eprints based institutional repository http://eprints.dur.ac.uk/.
2.5. Glasgow – Pilot EPrints research data repository with links to award information. Eprints based institutional repository, Enlighten http://eprints.gla.ac.uk/.
2.6. ICL – Library becoming licensed to mint DOIs and will be developing a data catalogue. Researchers are advised that the preferred repository will be the discipline specific repository, failing that they should use the Imperial College repository: the DSpace based institutional repository, Spiral http://spiral.imperial.ac.uk/.
2.7. KCL – The establishment of a register showing the location of research datasets, both at King’s and in external repositories is planned for delivery by the end of the 2014-15 session. ‘Research Portal’ is the PURE based institutional repository https://kclpure.kcl.ac.uk/portal/.
2.8. Lancaster – Hydra will possibly be used for the data repository. The Pure CRIS is to hold records
of Research Data outputs, to be made publicly available through Pure Portal. The CRIS may be
modified so that small datasets may also be stored. http://www.lancaster.ac.uk/library/information-
for/researchers/research-data-management/data-and-pure/
2.9. Leeds – Pilot Eprints data repository with Symplectics link, being tested as a continuation of the Roadmap project. Two instances of ePrints are envisaged, one for data repository, the other for Registry / catalogue. Have an EPrints based institutional repository http://eprints.whiterose.ac.uk/.
2.10. Liverpool – Nothing provided or planned. Currently have an EPrints based institutional repository https://research-archive.liv.ac.uk/.
2.11. Loughborough - Have announced that their infrastructure will involve Figshare for Institutions,
Arkivum and Symplectic http://ow.ly/DUAug.
2.12. Manchester – The University of Manchester Library is providing a Research Data Management service. No catalogue is yet available, but metadata is stored with DMP in the Research Data Management System. A data storage service is provided, and the University is currently working to provide a research data repository service. Manchester’s institutional repository, eScholar, is a Fedora institutional repository (supports submission of 16 main types of scholarly output) https://www.escholar.manchester.ac.uk/.
2.13. Nottingham - work is currently underway to roll out an institutional data repository/catalogue. Nottingham EPrints institutional repository content policy allows it to hold all types of materials except theses and dissertations http://eprints.nottingham.ac.uk/.
2.14. QMU - QMRO, the institutional repository is on the DSpace platform, and this is likely to be the platform for the institutional data repository. The Centre for Digital Music (C4DM) at Queen Mary University of London has a DSpace repository for Digital Music Data (see above).
2.15. Queen’s University Belfast – No data repository / catalogue yet. Two outward facing repository systems: ‘QUB Research Portal’ http://pure.qub.ac.uk/portal/ is PURE based, and ‘Qcite’ http://qcite.qub.ac.uk/, the institutional repository, is DSpace based. Both hold only papers.
2.16. Sheffield – A number of systems are being considered for the RDM technical infrastructure, including ePrints (currently used for the shared institutional repository), Symplectic Elements (the CRIS), various digital preservation systems, Hydra, ‘Figshare for Institutions’ and Arkivum.
2.17. Strathclyde – considering ePrints data registry / repository, with PURE as a data catalogue.
2.18. Surrey – RDM Roadmap task list includes identification of metadata catalogue – possibly
Symplectic https://www.surrey.ac.uk/research/researchdata/RDM%20Roadmap(V2).pdf.
2.19. UCL – No data repository / catalogue, but UCL Research Data Archive and access facility are planned for pilot late 2014. The institutional repository, UCL Discovery, is run on the EPrints platform. UCL hosts CAVA http://www.ucl.ac.uk/ls/cava/, a Human Communication AV Archive with storage on the Library Services Digital Collections service (ExLibris Digitool platform).
2.20 UEL – Considering the use of ePrints repository for registry and catalogue (no CRIS) above
Arkivum for storage.
2.21. York - The University Information Directorate is working towards providing a University data catalogue and repository for data. ‘YODL’ https://dlib.york.ac.uk/yodl/app/home/index, a DAMS based on Fedora Commons platform, currently holds some Humanities research data. One proposal involves YODL holding accessible data, the Archivematica digital preservation system to manage archival storage, and the CRIS, Pure, providing a Data registry and Data catalogue (Pure Portal) which facilitates discovery and access to the data https://risweb.st-andrews.ac.uk/portal/en/activities/eurocris-strategic-membership-meeting%28db41094d-4635-4803-9e57-01b856e1de12%29.html.
3. Discipline-based research data repositories hosted by UK HEIs
3.1. ADS (Archaeology Data Service) http://archaeologydataservice.ac.uk/ A disciplinary data repository based at the University of York; infrastructure includes the Fedora Commons repository platform.
3.2. Edina ShareGeo http://edina.ac.uk/projects/sharegeo/ Not an institutional repository, but based at The University of Edinburgh, here, DSpace has been customised to offer a repository that eases both the deposit and discovery of geospatial data.
3.3. Leeds DART Data Portal http://dartportal.leeds.ac.uk/ The Detection of Archaeological Residues using Remote-sensing Techniques (DART) research project maintains a CKAN data portal for the open data outputs from the project.
3.4. eCrystals at the University of Southampton http://ecrystals.chem.soton.ac.uk/ The University of Southampton department of Chemistry holds data from X-ray diffraction experiments in an EPrints repository.
3.5. UKDA http://www.data-archive.ac.uk/ Not an institutional, but a national social and economic research data repository based at the University of Essex. The UKDA provides the UK Data Service, which curates key quantitative and qualitative data, UK Data Service ReShare, curating data from ESRC funded research and the HDS
(successor to the AHDS). These are housed on a modified EPrints repository platform.
3.6. CARMEN Portal http://www.carmen.org.uk/portal The CARMEN Portal is a VRE to support e-Neuroscience, providing storage and processing services
over a Grid infrastructure. The CARMEN system is a three-tier web architecture consisting of a web
portal, an application layer and a storage layer, developed by a collaboration of researchers from 11
UK universities. The Java portal allows the user to access data and to create and run analysis tool on
remote servers. The storage layer is shared between MySQL databases and a SRB (Storage Resource
Broker) system. The application layer consists of Java servlets, providing a middleware layer that
bridges storage and portal.
John A. Lewis 14/11/2014
CKAN Public
Research Data
Catalogue
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Researcher & Project metadata
Dataset Discovery
Automatic Metadata
Capture
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Au
tom
atically cap
ture
d m
etad
ata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata Exte
rnal
SWO
RD
2
SWO
RD
2
Researcher & Project metadata
Research Data Flow Metadata Flow
Institu
tion
/ V
irtual O
rg
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d
<- P
rivate
Pu
blic ->
Datase
t Pu
blicatio
n
Datase
t Pu
blicatio
n
<- Data Metadata ->
Data Capture
Instrument
PURE
PURE Portal Institutional Repository
CKAN Active Data
Registry
DMP Service
DMPlans
(Man
ual U
plo
ad)
Researcher
Datase
t Acce
ss & R
eu
se
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Datase
t Pu
blicatio
n
Datase
t Acce
ss by e
xtern
al collab
orato
rs be
fore
pro
ject e
nd
Me
tadata H
arvest o
n
Datase
t Pu
blicatio
n
1. DataBris
HR & RMS
Research Data Storage Facility
Active Data Storage
Local Research Data Archive (Curated)
Research Data for Public Access
Researcher
Metadata on Dataset Publication
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
Au
tom
atic Me
tadata
Cap
ture
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Au
tom
atically captu
red
me
tadata &
Data P
roce
ssing A
ctive Data
Arch
ive D
ata SW
OR
D2
SWO
RD
2
Researcher & Project metadata
Research Data Flow Metadata Flow
External Research
Data Archive ?
Datase
t Me
tadata o
n
Datase
t Pu
blicatio
n
<- Private
P
ub
lic ->
Datase
t Pu
blicatio
n
File m
etad
ata
<- Data Metadata ->
Dataset Preservation
Data Capture
Instrument
DMP Service
DMPlans
DSp
ace C
on
ne
ctor
Researcher
Researcher & Project metadata
Symplectic Elements
HR & RMS
Re
search
er &
Pro
ject m
etad
ata
Dataset Access & Reuse
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Locally Curated Dataset Metadata & Description at
project end (Manual Upload)
Extern
al
Institu
tion
/ V
irtual O
rg
Datase
t Pu
blicatio
n
2. DSpace Cambridge
Data Registry
Cache for Public Access
Local Research Data Archive
Dspace Institutional Repository
Datase
t Pu
blicatio
n
Active Data Storage
Data Catalogue
Datase
t De
po
sit at pro
ject e
nd
Researcher
SWO
RD
2
Datase
t Pu
blicatio
n ?
Datase
t Acce
ss & R
eu
se
Data Catalogue Pure / Dspace ?
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
Au
tom
atic Me
tadata
Cap
ture
Automatically captured metadata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata
SWO
RD
2
Research Data Flow Metadata Flow
Research Data Vault
(Arkivum?)
Dataset Metadata on Dataset Publication
Datase
t Me
tadata &
Do
cum
en
tation
on
Pu
blicatio
n to
exte
rnal R
ep
osito
ry
Dataset Deposit at project end
<- Private
P
ub
lic ->
Datase
t P
ub
lication
<- Data Metadata ->
Data Capture
Instrument
DMP Service
?
DMPlans
Locally C
urate
d D
ataset M
etad
ata & D
escrip
tion
at pro
ject e
nd
(M
anu
al Up
load
)
Researcher
Dataset Access & Reuse
SWO
RD
2
Datase
t Pu
blicatio
n ?
Datase
t Acce
ss & R
eu
se
Extern
al
Institu
tion
/ V
irtual O
rg
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Researcher & Project metadata
3. Edinburgh Datashare
Data Asset Registry (Pure)
Data Storage for Public
Access
DataShare (DSpace)
Active Data Storage
DSp
ace M
etadata Sto
re
SWO
RD
2
HR & RMS
Researcher
SWORD2
Datase
t Me
tadata
Harve
st OA
I-PM
H
Datase
t Pu
blicatio
n
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
) ?
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
Au
tom
atic Me
tadata
Cap
ture
Datase
t Me
tadata H
arvest O
AI-P
MH
Automatically captured metadata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata
SWO
RD
2
Researcher & Project metadata
Research Data Flow Metadata Flow
<- Private
P
ub
lic ->
(File metadata)
<- Data Metadata ->
Data Capture
Instrument
CRIS?
Essex Research Data
(ePrints) Cache for
Public Access
ePrints Data
Storage ePrints
Metadata Catalogue
DMP Service
?
DMPlans
Active Data Storage
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d (M
anu
al Up
load
)
eP
rints C
on
ne
ctor
Researcher
Researcher & Project metadata
Dataset Access & Reuse
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Datase
t Acce
ss & R
eu
se
Extern
al
Institu
tion
/ V
irtual O
rg
SWO
RD
2
SWO
RD
2
Datase
t Pu
blicatio
n
Datase
t Pu
blicatio
n
Me
tadata H
arvest o
n
Datase
t Pu
blicatio
n
4. Essex Research Data
Researcher
HR & RMS
External Research
Data Archive ?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
Au
tom
atic Me
tadata
Cap
ture
Datase
t Me
tadata H
arvest O
AI-P
MH
Automatically captured metadata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata
Researcher & Project metadata
Research Data Flow Metadata Flow
<- P
rivate
Pu
blic ->
(File metadata)
<- Data Metadata ->
Data Capture
Instrument
Symplectic
Open Research
Exeter (DSpace) Cache for
Public Access
DSpace Data
Storage DSpace
Metadata Catalogue
DMP Service
?
DMPlans
Active Data Storage
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d (M
anu
al Up
load
)
DSp
ace C
on
ne
ctor
Researcher
Researcher & Project metadata
Dataset Access & Reuse
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Extern
al
Institu
tion
/ V
irtual O
rg
SWO
RD
2
SWO
RD
2
Datase
t Pu
blicatio
n
SWO
RD
2
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
Me
tadata H
arvest o
n
Datase
t Pu
blicatio
n
5. Open Research Exeter
Researcher
HR & RMS
External Research
Data Archive ?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
Automatic Metadata Capture
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Data P
roce
ssing A
ctive Data
Arch
ive D
ata Exte
rnal
SWO
RD
2
Research Data Flow Metadata Flow
Institu
tion
/ V
irtual O
rg
Datase
t Me
tadata o
n D
ataset P
ub
lication
<- P
rivate
Pu
blic ->
Datase
t Pu
blicatio
n
Locally curated?
<- Data Metadata ->
Data Capture
Instrument
Researcher
Researcher & Project metadata
HR & RMS
Researcher & Project metadata
Dataset Access & Reuse
Datase
t Pu
blicatio
n
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Dataset Metadata
Sync
Sharepoint Active Data Metadata Registry
Active Data Storage
Me
tadata H
arvest
6. Herts UHRA
Dataset Metadata & Description (Manual upload)
DMP Service
DMPlans
Dataset Metadata & Description at project end
Researcher
UHRA (DSpace)
Cache for Public Access
DSpace Data
Storage DSpace
Metadata Store
PURE
(PURE Portal) Research
Data Catalogue
SWO
RD
2
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
Externally curated?
Dataset Discovery?
External Research
Data Archive (Arkivum)
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Re
search
er
& P
roje
ct m
etad
ata
Dataset Discovery
Automatic Metadata Capture
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Datase
t Me
tadata &
De
scriptio
n
Data P
roce
ssing A
ctive Data
Arch
ive D
ata Exte
rnal SW
OR
D2
Researcher & Project metadata
Research Data Flow Metadata Flow
Institu
tion
/ V
irtual O
rg
Datase
t Me
tadata o
n
Datase
t Pu
blicatio
n
<- Private
P
ub
lic ->
Datase
t Pu
blicatio
n
File m
etad
ata
<- Data Metadata ->
Data Capture
Instrument
Converis
HR & URMS
Co
nve
riss Co
nn
ecto
r
Researcher
Re
search
er &
Pro
ject m
etad
ata
Dataset Access & Reuse
Datase
t Pu
blicatio
n
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Locally C
urate
d D
ataset M
etad
ata & D
escrip
tion
at pro
ject e
nd
(M
anu
al Up
load
)
Sharepoint & Sakai
Active Data Metadata Registry
Active Data
Storage
Data Catalogue (Hydra interface – Blacklight & SOLR)
Hull Hydra (Fedora)
7. Hull Hydra
DMP Service
DMPlans
Dataset Metadata & Description (Manual upload)
SWO
RD
2
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
Fedora Cache Public Access
Local Research Data Archive Data Registry?
Externally Curated Dataset Metadata
External Research
Data Archive ?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Researcher
ePrints Data Catalogue
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Project metadata
Dataset Discovery
SWO
RD
2
Datase
t Me
tadata
Harve
st OA
I-PM
H
Au
tom
atically captu
red
me
tadata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata Exte
rnal
SWO
RD
2
Research Data Flow Metadata Flow
Institu
tion
/ V
irtual O
rg
Dataset Metadata
Dataset Metadata on
Dataset Publication
<- Private
P
ub
lic ->
Datase
t Pu
blicatio
n
Datacite DOI
<- Data Metadata ->
Orbital Bridge Lincoln CKAN
Cache for Public Access
Local Research Data
Archive Data Registry
DMP Service
DMPlans Dataset Metadata & Description at project end (Manual Upload)
Researcher
Re
search
er m
etad
ata
Datase
t Acce
ss & R
eu
se
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
) 8. Lincoln Orbital
Researcher metadata Nucleus
AMS
SWO
RD
2
Researcher & Project metadata
ePrint record URL
Owncloud Active Data Metadata Registry
Active Data Storage
SWO
RD
2
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
Researcher Automatic Metadata Capture
Data Capture
Instrument
External Research
Data Archive ?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Researcher & Project metadata
Dataset Discovery
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Au
tom
atically captu
red
me
tadata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata
SWO
RD
2
Research Data Flow Metadata Flow
Dataset Metadata on Dataset Publication
<- Private
P
ub
lic ->
Datase
t Pu
blicatio
n
<- Data Metadata ->
My Projects
DMP Online
DMPlans
Researcher
Dataset Access & Reuse
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d (M
anu
al Up
load
)
Extern
al
Institu
tion
/ V
irtual O
rg
Dataset Metadata Harvest
9. Newcastle
VRE & E-Science Central
Active Data Metadata Registry
Active Data Storage
Automatic Metadata Capture
Data Capture
Instrument
Newcastle
CKAN Cache for
Public Access
Local Research Data Archive
CKAN Metadata
Store (Catalogue)
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
My Impact
Research Data
Catalogue
Researcher
External Research
Data Archive ?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Data P
roce
ssing A
ctive Data
Arch
ive D
ata
SWO
RD
2
Research Data Flow Metadata Flow
Dataset Metadata on
Dataset Publication
<- Private
P
ub
lic ->
Datase
t Pu
blicatio
n
<- Data Metadata ->
DMP Service
DMPlans
Researcher
Dataset Access & Reuse
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d (M
anu
al Up
load
)
Extern
al
Institu
tion
/ V
irtual O
rg
10. Oxford ORA-DATA
DataStage Active Data Metadata Registry
Active Data Storage
Automatic Metadata Capture
Data Capture
Instrument
ORA-Data (Fedora &
Hydra)
DataBank
DataFinder
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
Researcher & Project metadata
Symplectic Elements
HR & RMS
Re
search
er &
Pro
ject m
etad
ata
External Research
Data Archive (Arkivum)?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Dataset Metadata & Description
(Manual Upload)
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d
DataBank
Researcher
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Au
tom
atically captu
red
me
tadata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata
SWO
RD
2
Research Data Flow Metadata Flow
Dataset Metadata on
Dataset Publication
<- Private
P
ub
lic ->
Datase
t Pu
blicatio
n
<- Data Metadata ->
DMP Service
DMPlans
Researcher
Dataset Access & Reuse
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d (M
anu
al Up
load
)
Extern
al
Institu
tion
/ V
irtual O
rg
Metadata Harvest
11. ePrints Soton
Sharepoint Active Data Metadata Registry
Active Data Storage
Automatic Metadata Capture
Data Capture
Instrument
ePrints Soton
Archive Data Storage
Data Registry
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
Researcher & Project metadata
CRIS (bespoke)
HR & RMS
Researcher & Project metadata
Cache for Public Access
Data Catalogue
External Research
Data Archive ?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Researcher
Data Centre / Disciplinary Repository
Data Registry Data Archive /
Storage
Dataset Discovery
SWO
RD
2
Datase
t Me
tadata H
arvest O
AI-P
MH
Au
tom
atically captu
red
me
tadata
Data P
roce
ssing A
ctive Data
Arch
ive D
ata
SWO
RD
2
Research Data Flow Metadata Flow
Dataset Metadata on
Dataset Publication
<- Private
P
ub
lic ->
Datase
t Pu
blicatio
n
<- Data Metadata ->
DMP Service
DMPlans
Researcher
Dataset Access & Reuse
Extern
ally Cu
rated
Datase
t Me
tadata &
Do
cum
en
tation
on
Datase
t Pu
blicatio
n (M
anu
al up
load
)
Datase
t Me
tadata &
De
scriptio
n at p
roje
ct en
d (M
anu
al Up
load
)
Extern
al
Institu
tion
/ V
irtual O
rg
12. UWE
Sharepoint Active Data Metadata Registry
Active Data Storage
Automatic Metadata Capture
Data Capture
Instrument
UWE Research
Data Repository
(ePrints)
Archive Data Storage
Data Registry
Datase
t Acce
ss & R
eu
se
Datase
t Pu
blicatio
n
Researcher & Project metadata
CRIS (bespoke)
HR & RMS
Researcher & Project metadata
Cache for Public Access
Data Catalogue
External Research
Data Archive ?
Datase
t De
po
sit at pro
ject e
nd
Dataset Preservation
Researcher