20
CDL: Supporting the Research Life Cycle Perry Willett University of California Curation Center California Digital Library

CDL research lifecycle

Embed Size (px)

Citation preview

Page 1: CDL research lifecycle

CDL: Supporting the Research Life Cycle

Perry WillettUniversity of California Curation Center

California Digital Library

Page 2: CDL research lifecycle

University of California:• 10 campuses, 5 medical centers, 3 national laboratories• 238,000 students• 190,000 faculty members and staff• $4.7 billion in research funding and external grants

Page 3: CDL research lifecycle

California Digital Library: • Part of the University of California• Located organizationally in the Office of the President

Page 4: CDL research lifecycle

UC3:Partnership between CDL | 10 UC campuses | Peer institutions

Provide solutions, services, resources for digital assets

Pool & distribute diverse experience, expertise, & resources

Page 5: CDL research lifecycle

A life cycle approachCreate, edit, share, and save

data management plans

Curation repository: store, manage, and share research data

Self-service tool for metadata creation and submission to Merritt data

repository

Create and manage long-term identifiers

collect

Open Access publishing services / dynamic research platform

plan

manage share

Page 6: CDL research lifecycle

A life cycle approachCreate, edit, share, and save

data management plans

Curation repository: store, manage, and share research data

Create and manage long-term identifiers

plan

Open Access publishing services / dynamic research platform

Self-service tool for metadata creation and submission to Merritt data

repository

Page 7: CDL research lifecycle

DMPTool

• Connect researchers to resources to create a data management plan

• NSF and directorates, NIH, NEH, IMLS, foundations plus

• Customizable

Meeting funding agencies data management plan requirements

Primary Functions1. Step-by-step “wizard”

2. Templates and examples

3. Links to institutional resources and agency information

4. Plan publication and sharing

Page 8: CDL research lifecycle

• Precise identification of a dataset (DOI or ARK)

• Credit to data producers and data publishers

• A link from the traditional literature to the data

• Exposure and research metrics for datasets(Web of Knowledge, Google)

Primary Functions1. Create long term identifiers

2. Manage identifiers (and associated metadata) over time

3. Resolve identifiers

EZIDLong term identifiers made easy

@ezidCDL

Page 9: CDL research lifecycle

A life cycle approachCreate, edit, share, and save

data management plans

Curation repository: store, manage, and share research data

Create and manage long-term identifiers

collect

Open Access publishing services / dynamic research platform

Self-service tool for metadata creation and submission to Merritt data

repository

Page 10: CDL research lifecycle
Page 11: CDL research lifecycle
Page 12: CDL research lifecycle
Page 13: CDL research lifecycle
Page 14: CDL research lifecycle

A life cycle approachCreate, edit, share, and save

data management plans

Curation repository: store, manage, and share research data

Create and manage long-term identifiers

Open Access publishing services / dynamic research platform

Self-service tool for metadata creation and submission to Merritt data

repository share

Page 15: CDL research lifecycle

Merritt

• Developed and supported in-house• “Model free”

– No prescriptive requirements regarding format, structure, metadata, or genre

• UI and REST API• Strongly versioned

– Any change to data or metadata triggers a new version– All previous versions can be re-instantiated for retrieval– Intra-version compression (forward deltas) minimizes storage

duplication

Page 16: CDL research lifecycle

Merritt• Metadata

– User supplied: descriptive, …

– System augmented: technical, structural, provenance

• Replication and audit– Across two technologies

• OpenStack/Swift

• WAN NFS– Across two locations

• UCLA (two internal replicas)

• UCSD/SDSC (three internal replicas)

Page 17: CDL research lifecycle

eScholarship

Page 18: CDL research lifecycle

• UC’s institutional repository and publishing platform

• 90,000 publications• 78 open access journals• Repository for UC’s Open Access policy:

“…future research articles authored by faculty at all 10 campuses of UC will be made available to the public at no charge.”

eScholarship

Page 19: CDL research lifecycle

A life cycle approach

plan

collect

manage

share

Page 20: CDL research lifecycle

For more informationUC3 Data Management Planning Resources

http://www.cdlib.org/uc3https://dash.berkeley.eduhttps://dmptool.orghttp://ezid.cdlib.org

Twitter: – @ezidCDL – @UC3CDL– @TheDMPTool– @CalDigLib

Email: [email protected]