Digital | Curation | Centre An Introduction to the UK Digital Curation Centre Dr Liz Lyon, DCC Associate Director Outreach Director, UKOLN, University.

  • Published on
    28-Mar-2015

  • View
    213

  • Download
    1

Transcript

<ul><li>Slide 1</li></ul> <p>Digital | Curation | Centre An Introduction to the UK Digital Curation Centre Dr Liz Lyon, DCC Associate Director Outreach Director, UKOLN, University of Bath, UK Funded by: This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 CURL/SCONUL Workshop December 2005 Slide 2 Digital | Curation | Centre 2 Overview About the Digital Curation Centre Organisation and structure What is digital curation? e-Research cycle DCC activities Development activity Research agenda Advisory services Outreach programme Slide 3 Digital | Curation | Centre 3 UK Digital Curation Centre Development activities Research agenda Delivering services Outreach Programme http://www.dcc.ac.uk/ Slide 4 Digital | Curation | Centre 4 DCC people (some of them) Management &amp; Co-ordination Director Chris Rusbridge (University of Edinburgh) Community Support &amp; Outreach Led by Dr Liz Lyon (UKOLN, University of Bath) Service Definition &amp; Delivery Led by Professor Seamus Ross (HATII, University of Glasgow) Development Led by Dr David Giaretta (Astronomical Software &amp; Services, CCLRC) Research Led by Professor Peter Buneman (University of Edinburgh) Slide 5 Digital | Curation | Centre 5 For later use? In use now (and the future)? What is digital curation? Data preservationData curation StaticDynamic maintaining and adding value to a trusted body of digital information for current and future use Slide 6 Digital | Curation | Centre 6 (Very simple) e-Research Cycle and Data Curation Formulate hypothesis / ideas, test, experiment, observe: data creation, collection &amp; capture Adding value: Data linking, annotation, visualisation, simulation (New) knowledge extraction: data mining, modelling, analysis, synthesis e-Infrastructure Open access Collaboration Scholarly communications: data disclosure, publication, citation, discovery, re-use Data management storage &amp; validation: description, deposit, self-archiving, preservation, certification Data processing This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 Slide 7 Digital | Curation | Centre 7 (Very simple) e-Research Cycle and Data Curation Formulate hypothesis / ideas, test, experiment, observe: data creation, collection &amp; capture Adding value: Data linking, annotation, visualisation, simulation (New) knowledge extraction: data mining, modelling, analysis, synthesis e-Infrastructure Open access Collaboration Scholarly communications: data disclosure, publication, citation, discovery, re-use Data management storage &amp; validation: description, deposit, self-archiving, preservation, certification Data processing Slide 8 Digital | Curation | Centre 8 Slide 9 9 Engineering Product Information EPSRC Grand Challenge Project, Prof Chris McMahon, University of Bath Slide 10 Digital | Curation | Centre 10 Access Grid Collaborative telematic art Modify spaces for performers Interplay: Hallucinations Slide 11 Digital | Curation | Centre 11 Data capture &amp; integration into research workflows R4L Repository for the Laboratory Project (JISC- funded) automated data capture from instrumentation, deposit of results (chemistry) SMART TEA electronic Laboratory notebook + annotations Slide 12 Digital | Curation | Centre 12 (Very simple) e-Research Cycle and Data Curation Formulate hypothesis / ideas, test, experiment, observe: data creation, collection &amp; capture Adding value: Data linking, annotation, visualisation, simulation (New) knowledge extraction: data mining, modelling, analysis, synthesis e-Infrastructure Open access Collaboration Scholarly communications: data disclosure, publication, citation, discovery, re-use Data management storage &amp; validation: description, deposit, self-archiving, preservation, certification Data processing Slide 13 Digital | Curation | Centre 13 Learning &amp; Teaching workflows Research &amp; e-Science workflows Aggregator services: national, commercial Repositories : institutional, e-prints, subject, data, learning objects Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Harvesting metadata Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Resource discovery, linking, embedding Deposit / self- archiving Peer-reviewed publications: journals, conference proceedings Publication Validation Data analysis, transformation, mining, modelling Resource discovery, linking, embedding Deposit / self- archiving Learning object creation, re-use Searching, harvesting, embedding Quality assurance bodies Validation Presentation services: subject, media-specific, data, commercial portals Resource discovery, linking, embedding The scholarly knowledge cycle. Liz Lyon, Ariadne, July 2003. This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 Liz Lyon (UKOLN, University of Bath), 2005 Slide 14 Digital | Curation | Centre 14 Disciplinary data-centres Slide 15 Digital | Curation | Centre 15 eBank UK Project Two key themes: Open access to datasets Linking research data to publications and to learning UKOLN, University of Southampton, University of Manchester e-Science application Combechem : Grid-enabled combinatorial chemistry + National Crystallography Service Resource Discovery Network / PSIgate physical sciences portal http://www.ukoln.ac.uk/projects/ebank-uk/ Slide 16 Digital | Curation | Centre 16 A data repository entry Slide 17 Digital | Curation | Centre 17 Access to the underlying data: complex objects ecrystals.chem.soton.ac.uk Slide 18 Digital | Curation | Centre 18 Data descriptions Validation, publication &amp; discovery of data models &amp; schema Managing complex objects Metadata packaging standards METS MPEG 21 DIDL Semantic descriptions Formal controlled vocabularies High-level and domain ontologies Inter-disciplinary discovery Informal approaches Web 2.0 folksonomies Slide 19 Digital | Curation | Centre 19 Trusted digital repositories Audit Checklist for Certification Draft Report published August 2005 Research Libraries Group RLG-NARA Taskforce Defined criteria under 4 categories Organisation Functions, processes &amp; procedures Designated community &amp; usability Technologies &amp; technical infrastructure Slide 20 Digital | Curation | Centre 20 OAIS Reference Model Slide 21 Digital | Curation | Centre 21 DCC: Development DCC Approach to Digital Curation based on the Reference Model for an Open Archival Information System (OAIS); ISO standard, 14721: Monitoring international standards Development of a Representation Information (RI) registry/repository (DCC-RR) Recommendations for tools and methods for generating Representation Information Creating test-beds for digital curation tools Development info see http://dev.dcc.ac.uk for details of Wiki and email list open to all Slide 22 Digital | Curation | Centre 22 (Very simple) e-Research Cycle and Data Curation Formulate hypothesis / ideas, test, experiment, observe: data creation, collection &amp; capture Adding value: Data linking, annotation, visualisation, simulation (New) knowledge extraction: data mining, modelling, analysis, synthesis e-Infrastructure Open access Collaboration Scholarly communications: data disclosure, publication, citation, discovery, re-use Data management storage &amp; validation: description, deposit, self-archiving, preservation, certification Data processing Slide 23 Digital | Curation | Centre 23 Persistent identifiers for data citation Identify use cases: depositor, author, service provider, reader, publisher, ? Schemes: DOI, Handle, ARK, PURL Global identification: express as http URIs Added value services: CrossRef, resolution service, integration (Globus), look-up service Domain identifiers: e.g. International Chemical Identifier (INChI) codes Google molecules using InChIs demo: Peter Murray-Rust, University of Cambridge DCC Workshop June 2005 Glasgow Slide 24 Digital | Curation | Centre 24 One approach to data citation using DOIs Publication &amp; citation of scientific primary data project National Library for Science &amp; Technology (TIB), University of Hanover, Germany STD-DOI Project http://www.std-doi.de DOI registry for datasets Data publication agents: World Data Center Climate, GeoForschungsZentrum Potsdam Data requirements: quality control, long-term curation, use DOI resolver Exemplar data citation: Kamm, H; Machon, L; Donner, S (2004): Gas chromatography (KTB Field Lab), GFZ Potsdam. doi:10.1594/GFZ/ICDP/KTB/ktb- geoch-gaschr-p Slide 25 Digital | Curation | Centre 25 (Very simple) e-Research Cycle and Data Curation Formulate hypothesis / ideas, test, experiment, observe: data creation, collection &amp; capture Adding value: Data linking, annotation, visualisation, simulation (New) knowledge extraction: data mining, modelling, analysis, synthesis e-Infrastructure Open access Collaboration Scholarly communications: data disclosure, publication, citation, discovery, re-use Data management storage &amp; validation: description, deposit, self-archiving, preservation, certification Data processing Slide 26 Digital | Curation | Centre 26 Adding value: eBank linking data to publications Slide 27 Digital | Curation | Centre 27 Linking research to learning - embedding eBank aggregator service in a science portal for student learners Slide 28 Digital | Curation | Centre 28 Adding value through annotation DCC Research at the University of Edinburgh Scientific databases: Annotation scoping report AstroDAS: distributed annotation servers in astronomy New annotation model + prototype: top-ranked demonstration at recent DB conference Slide 29 Digital | Curation | Centre 29 DCC Research agenda Publishing &amp; integrating scientific databases Archiving past states of volatile databases Database provenance and annotation Organisational dynamics of trusted repositories Automating metadata extraction Cost-benefit analysis of data curation Rights and responsibilities Public domain, public interest, public funding paper Waelde &amp; McGinley Slide 30 Digital | Curation | Centre 30 (Very simple) e-Research Cycle and Data Curation Formulate hypothesis / ideas, test, experiment, observe: data creation, collection &amp; capture Adding value: Data linking, annotation, visualisation, simulation (New) knowledge extraction: data mining, modelling, analysis, synthesis e-Infrastructure Open access Collaboration Scholarly communications: data disclosure, publication, citation, discovery, re-use Data management storage &amp; validation: description, deposit, self-archiving, preservation, certification Data processing Slide 31 Digital | Curation | Centre 31 Facilitate post-processing and knowledge extraction Enable the acquisition of newly-derived information and knowledge Run complex algorithms over primary datasets Mining (data, text, structures) Modelling (economic, climate, mathematical, biological) Analysis (statistical, lexical, pattern matching, gene) Slide 32 Digital | Curation | Centre 32 Slide 33 Digital | Curation | Centre 33 DCC Case Study published: Wide Field Astronomy Unit Slide 34 Digital | Curation | Centre 34 Supporting the community DCC Outreach &amp; Services: HELPDESK@dcc.ac.uk (legal - technical guidance) Curation Manual 45 chapters planned, Briefing Papers Workshops: Future-proofing Institutional Web sites, Jan 19-20, London Information Days: regional 1 st International DCC Conference, Bath Sept 2005 PV2005 November, Edinburgh 2 nd International Conference November 2006 Glasgow tbc Slide 35 Digital | Curation | Centre 35 www.ijdc.net Peer-review Editorial Board Peter Buneman Editor (research) Production editor Richard Waller Papers for submission are very welcome! 1 st issue soon. Slide 36 Digital | Curation | Centre 36 Associates Network Goals Develop understanding, share best practice, advance research, promote recognition, develop consensus Membership International groups, national bodies, industry partners, funders, research groups, HEIs, FEIs, individuals Benefits Early access to R&amp;D outputs, advisory services, training, input to definition and design, community participation Discussion Forum www.dcc.ac.uk Please join us! Slide 37 Digital | Curation | Centre 37 Developing skills &amp; collaboration NSF Report : Data scientist Develop hybrid skills Embed in u/g, p/g curriculum Facilitate community collaboration: Researchers Data centres Libraries &amp; archives New roles??? Achieve cultural change Slide 38 Digital | Curation | Centre Thank you. Questions? e.lyon@ukoln.ac.uk Join the DCC Associates Network at www.dcc.ac.uk </p>

Recommended

View more >