19
P088; Presented in Canberra, 27 th March, 2008 000: Presented in Fremantle on 20 th October, 2008 GAIA RESOURCES Experiences in mobilizing biodiversity data – the story of HermesLite Tim Carpenter and Piers Higgs

P088; Presented in Canberra, 27 th March, 2008

  • Upload
    foster

  • View
    28

  • Download
    0

Embed Size (px)

DESCRIPTION

GAIA RESOURCES. Experiences in mobilizing biodiversity data – the story of HermesLite Tim Carpenter and Piers Higgs. P088; Presented in Canberra, 27 th March, 2008. GR000: Presented in Fremantle on 20 th October, 2008. INTRODUCTION. Motivation HermesLite Overview Architecture - PowerPoint PPT Presentation

Citation preview

Page 1: P088; Presented in Canberra, 27 th  March, 2008

P088; Presented in Canberra, 27th March, 2008GR000: Presented in Fremantle on 20th October, 2008

GAIA RESOURCES

Experiences in mobilizing biodiversity data – the story of

HermesLite

Tim Carpenter and Piers Higgs

Page 2: P088; Presented in Canberra, 27 th  March, 2008

INTRODUCTION

• Motivation

• HermesLite• Overview• Architecture• Implementation

• Implementation at the WA Museum

• Issues

• Directions

Page 3: P088; Presented in Canberra, 27 th  March, 2008

MOTIVATION

• Based on our experiences with Museums:• Cost had to be minimal• No capacity to serve data• Need minimal infrastructural change• Remote management necessary

• Funding for WA Museum (WAM) was made available through the Australian Biological Resources Study (ABRS)

• Under existing support arrangements to the WAM, Gaia Resources undertook the actual work

Page 4: P088; Presented in Canberra, 27 th  March, 2008

HERMESLITE

Overview

• HermesLite is an application designed to help institutions serve data - in accepted formats and standards - where they don’t have the capacity or resources to do this themselves.

• Open Source• Written in Java• Easy to manage within an institution• Can be remotely managed

Page 5: P088; Presented in Canberra, 27 th  March, 2008

HERMESLITE

Overview

• Deployable in two configurations:• Data Retrieval mode: A liteweight,

extensible, configurable Java application to extract and send data to…

• Data Receiver and Storage mode: Extensible Java servlet application to receive, translate and store the data.

• The Data Receiver and Storage mode can also act as the Retrieval mode if databases are local.

Page 6: P088; Presented in Canberra, 27 th  March, 2008

HERMESLITE

Internal Collection Database

External Database

HermesLite

Retrieval mode

HermesLite

Receive & Store mode

XML

MapServer

TAPIRLink

HTTPS

Architecture

Internal ServerExternal Server

Page 7: P088; Presented in Canberra, 27 th  March, 2008

HERMESLITE

Implementation

• Data Retrieval mode:• Unpack distribution files onto server• Create a configuration file for each database• Set up Java Wrapper Service to run

appropriately• Data Receiver and Storage mode:

• Install HermesLite web application• Extend HermesLite classes• Drop in handler JAR to process data• Install other software, (e.g. TapirLink)

Page 8: P088; Presented in Canberra, 27 th  March, 2008

HERMESLITE

Implementation - Data Retrieval

• Configurationvia a simple properties filewithvariables

Page 9: P088; Presented in Canberra, 27 th  March, 2008

HERMESLITE

Implementation - Data Storage

• Extend HermesLite classes to process received rows againstchosen dataformat or standard

Page 10: P088; Presented in Canberra, 27 th  March, 2008

HERMESLITE

Implementation

• This can represent a considerable organisational change in the way data is delivered

• Be prepared to guide your organisation through this process

Page 11: P088; Presented in Canberra, 27 th  March, 2008

EXAMPLE

Western Australian Museum

• Considerable discussion, demonstration, etc (“social engineering”) – still ongoing after four months

• Selection statements for each database (restricting “sensitive” data) – being tuned all the time, curators define these

• Darwin Core format• Restricted the fields being delivered

Page 12: P088; Presented in Canberra, 27 th  March, 2008

EXAMPLE

Western Australian Museum

• Production went live on 11th October, 2008• Data services are IP restricted to OZCAM only

WAM Collection Database

Bureau Database

HermesLite

Retrieval mode

HermesLite

Receive & Store mode

XML

MapServer

TAPIRLink

HTTPS

WAM ServerGaia Resource Bureau Server

Page 13: P088; Presented in Canberra, 27 th  March, 2008

ISSUES

Two main issues struck during implementation:

• “Social Engineering”

• Life Sciences IDentifier (LSID) resolution

Page 14: P088; Presented in Canberra, 27 th  March, 2008

ISSUES

Social Engineering

• Are all parts of the organisation ready to publish their data? Consider:• Management?• Data custodians (curators)?• Information Technology staff/providers?• Customers (both “big” and “little”)?

• Requires considerable cultural changes to organisations, especially when revenue from data provision

Page 15: P088; Presented in Canberra, 27 th  March, 2008

ISSUES

LSIDs

• A bureau service has issues with LSID resolution:• Originating institution WAM

(www.museum.wa.gov.au)• Bureau service – hosting and resolving – at

Gaia Resources (www.gaiaresources.com.au)

• Difficulties in getting access to DNS records• Currently investigating opportunities to get

around this by using the OZCAM cache as the LSID resolver

Page 16: P088; Presented in Canberra, 27 th  March, 2008

DIRECTIONS

Future Directions

• Discussions with GBIF and the ALA about integrating with other provider toolkits

• Potential use within the Museum community in Australia under the Bureau service model

Page 17: P088; Presented in Canberra, 27 th  March, 2008

DIRECTIONS

Collection Database

Bureau Database

HermesLite

Retrieval mode

HermesLite

Receive & Store mode

MapServer

TAPIRLink

Bureau Server

Collection Database

HermesLite

Retrieval mode

SFTP

TAPIRLink

TAPIR XML

Collection Database

CSV

SFTP

HTT

PS

Page 18: P088; Presented in Canberra, 27 th  March, 2008

ACKNOWLEDGEMENTS

Would not be possible without:

• Funding from the ABRS

• Support from the WAM, both from management and the curators themselves

• Assistance and advice from the Faunal Collections Informatics Group members

Page 19: P088; Presented in Canberra, 27 th  March, 2008

MORE INFORMATION

For more information…

• Links and information available from the Gaia Resources web site, including links to the SourceForge project.

www.gaiaresources.com.au