17
DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro ([email protected]) Institute for Quantitative Social Science (IQSS) Harvard University RDA 5 th Plenary WG RDA/WDS Publishing Data Workflows March 11, 2015

DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro ([email protected]) Institute

  • Upload
    others

  • View
    32

  • Download
    0

Embed Size (px)

Citation preview

Page 1: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro ([email protected]) Institute for Quantitative Social Science (IQSS) Harvard University

RDA 5th Plenary WG RDA/WDS Publishing Data Workflows March 11, 2015

Page 2: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

An Integrated & Automated Journal / Data Publishing Workflow

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

2

Journal

Repository

Page 3: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Current Workflows in Dataverse: To Connect Data to Journals A. Journals include Dataverse as a Recommended Repository

B. Authors Contribute Directly to a Journal’s Dataverse

C. Automated Integration of Journal + Dataverse (e.g., OJS)

3

Page 4: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Example of Option C: Phase 1 OJS / Dataverse Integration

ü  Integrating Open Journal Systems (OJS) with Dataverse ü  Reference Implementation: Automated via SWORD API

ü  Pilot with ~ 50 journals + expand to 1000s using OJS. ü  Dataverse plugin is automatically available w/ OJS. ü  Future: Embed Dataverse widgets into journal article.

http://projects.iq.harvard.edu/ojs-dvn

4

Project Details: 2012-2014 Project Details: 2012-2014 Project Details: 2012-2014

Project Details: 2012-2014

Page 5: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

In the Backend: Technical Workflow

Client sends: ü  XML file: AtomPub "entry”

with Dublin Core Terms (e.g., title, creator, isReferencedBy (article citation), …)

ü  Zip file: All data files associated with that dataset.

Repository sends: ü  XML file: “Deposit Receipt”

send data citation from repository to client.

Plus updates from client to server during lifecycle (CRUD): In review, reject (delete), publish first version, update new versions.

5

Page 6: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

On the Frontend: OJS Dataverse Plugin Walkthrough

6

Page 7: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Journal Manager Sets Up Plugin in OJS 7

Page 8: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Journal Manager Sets Up Data Policies

Read full Data Policies / Guidelines Template: http://bit.ly/1xkLjoZ

Including Guidelines for: 1)  Authors (data citation) 2)  Reviewers 3)  Copyeditors

8

Page 9: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Author Submits Manuscript + Data (1) 9

Page 10: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Author Submits Manuscript + Data (2)

Option to: (a) deposit into Dataverse OR; (b) if data is already in a repository can include the data citation (w/ persistent URL/identifier).

10

To-Do: Support for adding multiple datasets to a journal article.

Page 11: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Editor Reviews Article + Data 11

Page 12: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Approved = Data Published in Dataverse

When issue is published: 1) URL to Article displays in Dataverse. 2) Data Citation shows up in OJS Article (see next slide).

12

1

2

Page 13: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Article in OJS: Published w/ Data Citation

13

Page 14: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Video of OJS Dataverse Plugin Demo 14

http://bit.ly/1D1hphu

Page 15: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Phase 2: Expansion of API + Workflows

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

Features for automatic

data citation insertion into

article.

Workflows + features for reviewing

data before article

publication.

Long term preservation +

persistent access to dataset.

New versions of a

dataset induce new research.

Automatic integration w/

data repositories

(common repository API).

Code

Submit

Review

Publish Reuse, Validate &

Extend

Prepare new submission

15

2015-2016 (collaboration w/ Odum Institute)

1. Expand to more journals, publishing systems, & workflows 1. Expand to more journals, publishing systems, & workflows 1. Expand to more journals, publishing systems, & workflows

1.  Expand to more journals, publishing systems, & workflows 2.  Develop Community-Based Repository API Standard:

Work w/ RDA, WDS, Data FAIRport, FORCE11, CODATA, etc…

q  Should we extend the Repository API beyond SWORD? q  Support for additional Metadata Schemas & fields (non-DC)? q  Support for more/which dataset review workflows?

Project Goals

Project Questions

Page 16: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

How Do I Get Involved?

16

1 1

Sign up to Contribute: Repositories Workshop + Dataverse Community Meeting June 9-11, 2015 @ Harvard http://bit.ly/1A51atJ

Find Out More: * Visit our Collaborations page: http://bit.ly/1Bg2nkw * Dataverse Project Site: http://dataverse.org

Contact Project Coordinator: Eleni Castro ([email protected])

1

2

3

Page 17: DATA PUBLISHING WORKFLOWS WITH DATAVERSEprojects.iq.harvard.edu/files/ojs-dvn/files/rda... · DATA PUBLISHING WORKFLOWS WITH DATAVERSE Eleni Castro (ecastro@fas.harvard.edu) Institute

Thank You! Any Questions?

17

Contact Me: Eleni Castro ([email protected])