Upload
zoe-watson
View
212
Download
0
Tags:
Embed Size (px)
Citation preview
UKOLN is supported by:
Digital Libraries and e-Research: new horizons, new challenges?
Dr Liz Lyon, DirectorUKOLN, University of Bath, UK
8th International Bielefeld Conference
February 2006.
www.bath.ac.uk
a centre of expertise in digital information management
www.ukoln.ac.uk
This work is licensed under a Creative Commons LicenceAttribution-ShareAlike 2.0
8th International Bielefeld Conference 2
Overview
1. Data-intensive science - contextual drivers• Scientific: e-Research process• Socio-political: open access to data-sets• Technical: data curation and repository infrastructure
2. An update and exemplars from the UK
3. Some issues for libraries• Engagement and advocacy• Skills and expertise• Strategic position and profile
8th International Bielefeld Conference 3
(Very simple) e-Research Cycle and Data Curation
Formulate hypothesis / ideas, test, experiment, observe: data creation,
collection & capture
Adding value: Data linking, annotation,
visualisation, simulation
(New) knowledge extraction: data mining, modelling, analysis, synthesis
e-Infrastructure
Open access
Collaboration
Scholarly communications: data disclosure, publication, citation, discovery, re-use
Data management storage & validation: description, deposit,
self-archiving, preservation,
certification
Data processing
Data processingData processing
Data processing
Data processing
This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0
8th International Bielefeld Conference 4
(Very simple) e-Research Cycle and Data Curation
Formulate hypothesis / ideas, test, experiment, observe: data creation,
collection & capture
Adding value: Data linking, annotation,
visualisation, simulation
(New) knowledge extraction: data mining, modelling, analysis, synthesis
e-Infrastructure
Open access
Collaboration
Scholarly communications: data disclosure, publication, citation, discovery, re-use
Data management storage & validation: description, deposit,
self-archiving, preservation,
certification
Data processing
Data processingData processing
Data processing
Data processing
This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0
8th International Bielefeld Conference 5
8th International Bielefeld Conference 6
Engineering Product Information
EPSRC Grand Challenge Project, Prof Chris McMahon, University of Bath
8th International Bielefeld Conference 7
– Access Grid – Collaborative telematic art– Modify spaces for performers – Interplay: Hallucinations
8th International Bielefeld Conference 8
Library issues 1: Data capture & integration into research workflows• R4L Repository for the Laboratory Project (JISC-funded)
automated data capture from instrumentation, deposit of results (chemistry)
• SMART TEA electronic Laboratory notebook + annotations• How is primary research data captured in faculty and
academic departments?• Where and how is primary research data stored in your
institution?
8th International Bielefeld Conference 9
(Very simple) e-Research Cycle and Data Curation
Formulate hypothesis / ideas, test, experiment, observe: data creation,
collection & capture
Adding value: Data linking, annotation,
visualisation, simulation
(New) knowledge extraction: data mining, modelling, analysis, synthesis
e-Infrastructure
Open access
Collaboration
Scholarly communications: data disclosure, publication, citation, discovery, re-use
Data management storage & validation: description, deposit,
self-archiving, preservation,
certification
Data processing
Data processingData processing
Data processing
Data processing
This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0
8th International Bielefeld Conference 10
Digital repositories: a UK view in 2006
• Institutional repository trends D-Lib Magazine Sept 2005– Statistics: UK 31, (Germany 103, Sweden 25)– Policy: UK RCUK draft, (Germany YES), – National programmes: UK YES (Germany Sweden Netherlands)
• Pioneering work: eprints.org, ePrints UK, eBank UK……• University of Southampton has a Self-Archiving Policy and a
mandate rather than a recommendation• OpenDOAR Directory of Open Access repositories: Univ
Nottingham and Lund
• JISC £4M Digital Repository Programme + support : use cases, reference models, standards, deposit APIs, DigiRep wiki
8th International Bielefeld Conference 11
Federated repository architectures
fusion layer ‘repository federator’
repository repository repository repository repository
portal portal portal portal portal
heterogeneous - metadataformats, content formats,identifiers, packagingstandards
homogeneous - metadataformats, content formats,identifiers, packagingstandards
From Andy Powell: http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/presentations/jiie-jcs-2005/
• Global
• Inter-disciplinary
• Cross-sectoral
• Multiple format types
• Data, eprints, images…….
• e-Framework: JISC & DEST
• Defining common services + domain-specific services + repository services
8th International Bielefeld Conference 12
Trusted digital repositories• Audit Checklist for Certification Draft August 2005• Research Libraries Group RLG-NARA Taskforce • Defined criteria under 4 categories
– Organisation– Functions, processes & procedures– Designated community & usability– Technologies & technical infrastructure
• UK Digital Curation Centre– Providing advice, tools and support services – 2nd DCC International Conference Glasgow November 21-22
http://www.dcc.ac.uk/
8th International Bielefeld Conference 13
Open access driver?
8th International Bielefeld Conference 14
Learning & Teaching workflows
Research & e-Science workflows
Aggregator services: national, commercial
Repositories : institutional, e-prints, subject, data, learning objects
Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules
Harvestingmetadata
Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media
Resource discovery, linking, embedding
Deposit / self-archiving
Peer-reviewed publications: journals, conference proceedings
Publication
Validation
Data analysis, transformation, mining, modelling
Resource discovery, linking, embedding
Deposit / self-archiving
Learning object creation, re-use
Searching , harvesting, embedding
Quality assurance bodies
Validation
Presentation services: subject, media-specific, data, commercial portals
Resource discovery, linking, embedding
The scholarly knowledge cycle.
Liz Lyon, Ariadne, July 2003.
This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0
© Liz Lyon (UKOLN, University of Bath), 2005
8th International Bielefeld Conference 15
eBank UK Project• Two key themes:
– Open access to datasets
– Linking research data to publications and to learning
• UKOLN (lead), University of Southampton, University of Manchester• Hybrid team: scientists, computer scientists and digital library specialists• e-Science application ‘Combechem’ : Grid-enabled combinatorial chemistry
+ National Crystallography Service
http://www.ukoln.ac.uk/projects/ebank-uk/
8th International Bielefeld Conference 16
A data repository entry ecrystals.chem.soton.ac.uk
8th International Bielefeld Conference 17
Access to the underlying data: complex objects
8th International Bielefeld Conference 18
Library issues 2: data descriptions• Validation, publication & discovery of
data models & schema• Complex objects metadata packaging
standards– METS– MPEG 21 DIDL
• Semantic descriptions– Formal controlled vocabularies– High-level and domain ontologies– Inter-disciplinary discovery
• Informal / social approaches Web 2.0 “folksonomies”
• eBank Application Profile publication• What data models and metadata
schema are in place?• Have librarians been involved in
their development?
8th International Bielefeld Conference 19
(Very simple) e-Research Cycle and Data Curation
Formulate hypothesis / ideas, test, experiment, observe: data creation,
collection & capture
Adding value: Data linking, annotation,
visualisation, simulation
(New) knowledge extraction: data mining, modelling, analysis, synthesis
e-Infrastructure
Open access
Collaboration
Scholarly communications: data disclosure, publication, citation, discovery, re-use
Data management storage & validation: description, deposit,
self-archiving, preservation,
certification
Data processing
Data processingData processing
Data processing
Data processing
This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0
8th International Bielefeld Conference 20
Discovering data:
Coles, S.J., Day, N.E., Murray-Rust, P., Rzepa, H.S., Zhang, Y., Org. Biomol. Chem., 2005, (10),1832-1834. DOI: 10.1039/b502828k
• Domain identifier: International Chemical Identifier (INChI) code• Google molecule using INChISlide from Simon Coles
8th International Bielefeld Conference 21
Library issues 3: Persistent identifiers for data citation
• How will they be used? We need use cases: depositor, author, service provider, reader, publisher?
• Schemes: DOI, Handle, ARK, PURL• Publication & citation of scientific primary data project
National Library for Science & Technology (TIB), University of Hanover, Germany. STD-DOI Project http://www.std-doi.de – DOI registry for datasets
• eBank is working with TIB to assign DOIs to crystal structure data
• What persistent identifiers have been assigned to your data?
• Was the Library involved in the process?
8th International Bielefeld Conference 22
(Very simple) e-Research Cycle and Data Curation
Formulate hypothesis / ideas, test, experiment, observe: data creation,
collection & capture
Adding value: Data linking, annotation,
visualisation, simulation
(New) knowledge extraction: data mining, modelling, analysis, synthesis
e-Infrastructure
Open access
Collaboration
Scholarly communications: data disclosure, publication, citation, discovery, re-use
Data management storage & validation: description, deposit,
self-archiving, preservation,
certification
Data processing
Data processingData processing
Data processing
Data processing
This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0
8th International Bielefeld Conference 23
Adding value: eBank linking data to
publications
8th International Bielefeld Conference 24
Linking research to learning - embedding eBank aggregator service in a science portal for student learners
8th International Bielefeld Conference 25
Integration into the curriculum and e-Learning workflows
• MChem course • Assess role in
Undergraduate Chemical Informatics courses
• Pedagogic evaluation• February – May 2006• Report & workshop to
follow.
8th International Bielefeld Conference 26
(Very simple) e-Research Cycle and Data Curation
Formulate hypothesis / ideas, test, experiment, observe: data creation,
collection & capture
Adding value: Data linking, annotation,
visualisation, simulation
(New) knowledge extraction: data mining, modelling, analysis, synthesis
e-Infrastructure
Open access
Collaboration
Scholarly communications: data disclosure, publication, citation, discovery, re-use
Data management storage & validation: description, deposit,
self-archiving, preservation,
certification
Data processing
Data processingData processing
Data processing
Data processing
This work is licensed under a Creative Commons LicenseAttribution-ShareAlike 2.0
8th International Bielefeld Conference 27
8th International Bielefeld Conference 28
Library issues 4: Adding value and repository services
• Adding value
- Linking, annotation, visualisation
• Repository services for knowledge extraction
- Mining (data, text, structures)
- Modelling (economic, climate, mathematical, biological)
- Analysis (statistical, lexical, pattern matching, gene)
• How is your data being used and re-used?
8th International Bielefeld Conference 29
Library issues 5: workforce development and capacity building
• NSF Draft Report 2005 Long-lived digital data collections
• “Data scientist” - hybrid skills • Facilitate collaboration:
researchers, data centres, digital libraries & archives communities
• How does your Library shape up?
• SWOT analysis
8th International Bielefeld Conference 30
STRENGTHS
Scholarly communications role
Links with academic community
Content / collection management / stewardship practice
Cataloguing, classification & metadata expertise
(e)-Service delivery function
WEAKNESSES
Historic “document tradition”
Synergies between physical & digital worlds are still evolving
Shortage of technical skills
Cautious approach to innovation
Vision? (“its not our problem….”)
THREATS
Paradigm shift in research will out-pace change in libraries
Researchers will (only?) use on-demand e-Services
Libraries may lose their role in scholarly communications and eResearch workflows
OPPORTUNITIES
Build on ePrints work & eLearning experience
Exploit links with researchers - they need your skills
Seek funding to engage in innovative projects & services
Develop local, regional, national, global partnerships
8th International Bielefeld Conference 31
Libraries: Facing the future?
• Develop leadership & vision for eResearch engagement• Review organisational structures
– Extend & re-profile the Faculty/Subject/Reference Librarian role? – Closer collaboration with Computing Services?
• Provide eServices for data– We “do” eLearning so why not eResearch?– Include in institutional digital asset management
• Promote professional development of staff– Awareness-raising activities, new skills– Greater engagement, hybrid roles and hybrid teams
• Build new partnerships, new business models • Facilitate Transformational Change in Libraries
Thank you.Questions?…..
More information: UKOLN http://www.ukoln.ac.uk/
UKOLN receives core funding from the Joint Information Systems Committee (JISC) and the Museums, Libraries & Archives Council
(MLA) and is based at the University of Bath, UK.