2
Tourte, G. J. L., Tonkin, E. L., Valdes, P. J., & Ball, A. (2010). PEG- BOARD -- Open Access, Reuse and Preservation of Palæoclimate Data. Poster session presented at 6th International Digital Curation Conference, Chicago, IL, United States. Peer reviewed version Link to publication record in Explore Bristol Research PDF-document University of Bristol - Explore Bristol Research General rights This document is made available in accordance with publisher policies. Please cite only the published version using the reference above. Full terms of use are available: http://www.bristol.ac.uk/pure/about/ebr-terms.html

Tourte, G. J. L., Tonkin, E. L., Valdes, P. J., & Ball, A ...GregoryJ.L.Tourte School of Geographical Sciences The University of Bristol, UK [email protected] Emma L.Tonkin

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Tourte, G. J. L., Tonkin, E. L., Valdes, P. J., & Ball, A ...GregoryJ.L.Tourte School of Geographical Sciences The University of Bristol, UK g.j.l.tourte@bristol.ac.uk Emma L.Tonkin

Tourte, G. J. L., Tonkin, E. L., Valdes, P. J., & Ball, A. (2010). PEG-BOARD -- Open Access, Reuse and Preservation of Palæoclimate Data.Poster session presented at 6th International Digital Curation Conference,Chicago, IL, United States.

Peer reviewed version

Link to publication record in Explore Bristol ResearchPDF-document

University of Bristol - Explore Bristol ResearchGeneral rights

This document is made available in accordance with publisher policies. Please cite only the publishedversion using the reference above. Full terms of use are available:http://www.bristol.ac.uk/pure/about/ebr-terms.html

Page 2: Tourte, G. J. L., Tonkin, E. L., Valdes, P. J., & Ball, A ...GregoryJ.L.Tourte School of Geographical Sciences The University of Bristol, UK g.j.l.tourte@bristol.ac.uk Emma L.Tonkin

Introduction

The BRIDGE research group was set up in 2003, and aims to improve theunderstanding of natural climate and environmental variability and to usethis knowledge to predict future changes more accurately and assess itsimpact on all aspects of human society.

Data Preservation

The work on which the PEG-BOARD project focuses inherits from priordevelopments in the same area, including a software and service basisdesigned to support a primary activity of the research group -palæoclimate modeling via software simulation around sampledhistorical data. This service set is approximately fifteen years old.Palæoclimate simulations generate a great deal of data that is frequentlyreused by researchers in the sciences and humanities, worldwide.However, as a high-performance computing application, the quantities ofdata created are extremely large, meaning that participants havehistorically had to develop and apply de-facto preservation and datacompression or summarisation policies. Since types of informationrequired by users from areas as diverse as evolutionary biology,archæology and earth science vary greatly, the project has alsodeveloped,based on existing software, an in-house interface designed tosupport tailored information extraction from climate model information.

BRIDGE Data

The BRIDGE group has chosen to provide open access to data, withparticular focus on those working internally to the research group,external researchers in the same field, and researchers with aninterdisciplinary focus meshing in some way with the data available fromBRIDGE palæoclimatology research, such as archæologists, biologists, andearth scientists. As user requirements have become evident over the pastdecade, a web site has been developed in an ad hoc manner that hasenabled non-expert users to make use of a number of common methodsof individual or comparative analysis, that is, to make use of the data fortheir own purposes.

Educational Reuse

The proportion of internal and external data reuse is approximatelyequal. Educational reuse occurs periodically following the term-basedstructure. Another major form of reuse is the research-oriented reuse,unsurprisingly since the platform operates primarily as an e-Scienceresource.

Media Reuse

BBC's Incredible Human Journey is an example ofmedia use of a dataset developed from a set ofsimulations originally designed for a single purpose.It is noticeable that it has been accessed a numberof times, each time by different users. Whilst the firsttwo spikes in reuse are likely to do with its initialuse case, later accesses are pobably irrelevant, asthe documentary had already been aired. There is areasonably clean 'decay curve' visible in this graph,expected for resources with declining importanceover time.

Discussion & Conclusion

This study shows us that each dataset beats with a different pulse, andthat the key to gaining a better understanding of these patterns of use isto begin to model them according to the forces that drive eachunderlying process. To provide a detailed model for each form of reuse isperhaps beyond practical reach. Predictive modeling of reuse patterns ofa given dataset shares a characteristic with climate data modelling: thatis, that the behaviour of the wider system that drives interest and activityin a research domain might properly be described as having a chaoticelement. Through data mining, we also begin to see how data setsinterrelate.

In future work, we will continue our development of data managementpolicies and documentation, best practice and an enriched softwarebackend as far as possible generic to any sort of climate data and asaccessible as possible to most disciplines. We also intend to continue ourresearch in data mining, seeking to improve our identification of usagepatterns, and of relationships between datasets such as the obsolescenceand replacement pattern.

http://www.bridge.bristol.ac.uk/

http://www.paleo.bris.ac.uk/projects/peg-board/

PPEEGG--BBOOAARRDD -- OOppeenn AAcccceessss,, RReeuussee aanndd PPrreesseerrvvaattiioonn ooff PPaallææoocclliimmaattee DDaattaa

References

[Journal article] Higgins, S. (2008). The DCC Curation Lifecycle Model, International Journal of Digital

Curation, 3(1), 134-140.

[journal article] Jones, S., Ball, A., & Ekmekcioglu, C. (2008). The Data Audit Framework: A First Step in

the Data Management Challenge. International Journal of Digital Curation, 3(2), 2008.

[report] ISO 14721:2003. Space data and information transfer systems -- Open archival information

system -- Reference model. Geneva: International Organization for Standardization.

[proceedings] Tourte, G., Tonkin, E., & Valdes, P., 2010. (2010). The PEG-BOARD Project: A Case Study

for BRIDGE. In: 14th International Conference on Electronic Publishing.

0

100

200

300

400

500

600

700

800

900

1000

2008-0

3

2008-0

4

2008-0

5

2008-0

6

2008-0

7

2008-0

8

2008-0

9

2008-1

0

2008-1

1

2008-1

2

2009-0

1

2009-0

2

2009-0

3

2009-0

4

2009-0

5

2009-0

6

2009-0

7

2009-0

8

2009-0

9

2009-1

0

2009-1

1

2009-1

2

2010-0

1

2010-0

2

2010-0

3

2010-0

4

2010-0

5

Un

iqu

e u

sag

e e

ven

ts/m

on

th

Month

Figure 1: Example of educational reuse of a dataset over a 2 year period

Gregory J. L. TourteSchool of Geographical Sciences

The University of Bristol, UK

[email protected]

Emma L. TonkinUKOLN

The University of Bath, UK

[email protected]

Prof. Paul ValdesSchool of Geographical Sciences

The University of Bristol, UK

[email protected]

Alex BallUKOLN

The University of Bath, UK

[email protected]

0

2

4

6

8

10

12

14

2008-03

2008-04

2008-05

2008-06

2008-07

2008-08

2008-09

2008-10

2008-11

2008-12

2009-01

2009-02

2009-03

2009-04

2009-05

2009-06

2009-07

2009-08

2009-09

2009-10

2009-11

2009-12

2010-01

2010-02

2010-03

2010-04

2010-05

Un

iqu

e u

sag

e e

ven

ts/m

on

th

Month

Primary Investigator/CreatorResearch Group leader

External User/TargetPhD Student

0

50

100

150

200

250

2008-0

5

2008-0

6

2008-0

7

2008-0

8

2008-0

9

2008-1

0

2008-1

1

2008-1

2

2009-0

1

2009-0

2

2009-0

3

2009-0

4

2009-0

5

2009-0

6

2009-0

7

2009-0

8

2009-0

9

2009-1

0

2009-1

1

2009-1

2

2010-0

1

2010-0

2

2010-0

3

2010-0

4

2010-0

5

2010-0

6

2010-0

7

Un

iqu

e u

sag

e e

ven

ts/m

on

th

Month

xbooa

xboob

xboof

Figure 2: Media Reuse. Access to BBC's "The Incredible Journey" dataset

Figure 3: Example of usage decay curves