Upload
tyler-walters
View
1.586
Download
2
Tags:
Embed Size (px)
Citation preview
Title Here
Title Here, Optional or
Unit Identifier
Advancing the fourth paradigm of research:
Assimilating repositories into active
research phases
Tyler Walters
Dean, University Libraries, Virginia Tech
SPARC Conference, Kansas City, March 12, 2012
2
The Rise of
Virtual Environments
Repositories are being woven into “virtual ecosystems,”
they are holistic and support communities of practice
• Early stages / deposit: raw/early phase data, notes, etc.
• Annotating, sharing within research groups, commenting, etc.
• Research proposal writing, project planning, etc.
• Tools:
• Discovery, analysis, visualization, and text/data/image mining
are being used in concert with repositories
• Virtual communities and their communication tools
• e.g., social media and community networking capabilities
Which projects are highlighted?
3
TARDIS
Purdue University Research Repository | PURR
e-Research
Metaman MyTardis MyTardis
AustralianResearch Data
Commons
ORCA
MyTardis
RM4
Synchrotron
'
Institutional Research
Data Registry
MX1 / MX2 Beamline
Researcher's Computer
Protein Crystallography
Research Data Management Platform
(for raw, processed, refined, and published data)
Metadata
Extraction
Monash Raw Data
Data & Metadata
Metadata
LEGEND
Protein Crystallography Research Data and Metadata Workflow Version 0.93/6/2011
Computer
Cluster
Research Admin
Repository
InstitutionalResearch
Data Registry
MyTardis
InstitutionalResearch
Data Registry
Proposed
MyTardis
The Australian Repositories
for Diffraction ImageS
MetadataHarvester
Future?:
Virtual lab
system
From Capture
to Publication
Early Stage/Deposit
• Move curation upstream in the data/information life cycle
• Automatically capture metadata, defined by the data producers
• Provide facilities for annotation and mark-up of data
5
Early Stage:
The Active Curation Model
Active Curation
Social Media
Data
Metadata
Workflows
Review
Rating
Commenting
Tools and Toolkits
• A Critical Intersection in the ‘Virtual Ecosystem’ is:
• Developing toolkits for discovery, analysis, visualization,
and text/data/image mining… all are being used with
repositories
• Leveraging existing tools (open source and proprietary)
• Incorporating custom, discipline-specific tools
7
Tools + Repositories
Tools & Toolkits
8
Functionality:
• By data type
• Search
• Visualization
• Subsetting
• Analysis
• Services
Kepler
DMP-Tool
Investigator Toolkit Activities (from DataONE)
Plan
Collect
Assure
Describe
Preserve
Discover
Integrate
Analyze
Communities and
Communication
• Co-authorship
• Co-funding
• Micro-citation
• Shared project
repositories
• Shared tags
• Threaded discussions
• Quoting, forwarding, …
• (reviewing, commenting)
(slide from SEAD)
Working Group Support
11
• File share
• Wikis
• To do lists
• Blogs
• Calendars
• Forums
• Project notes
• Commenting
• Tagging
• Proposal
writing
How do IRs and “papers” fit in? IRs are being leveraged in these new developments
• Services over an
active content layer that is backed by/harvested into a federated archive infrastructure based on institutional resources
(slide from SEAD)
Institutional Repositories
Network of Data Producers
Web User Interface
Active Content Repository
Services Provided
Virtual Archives
User Network
Data Conservancy
IU ICPSR
Content Mining
Curation Decisions
Archival data
generation
Other services
RPI UIUC UM
Linked Data and Repositories
• Tag and annotate data
• Overlay it with reference data
• Organize it in domain terminology
• Link it to people, papers, projects,
conversations…
(slide from SEAD)
Thank you…
Tyler Walters
tywalters1 = Skype / Twitter
Acknowledgements for slides and conversations:
• Robert McDonald (Indiana), SEAD
• William Michener (New Mexico), DataONE
• Antohny Bietz and Steve Androulakis (Monash), TARDIS
• Michael Witt (Purdue), PURR
• Suzie Allard (Tennessee), ORNL DAAC
• Sayeed Chourdhury (Johns Hopkins), Data Conservancy
14