29
Controlled Vocabularies and Data Integration Presentation to ICSM Metadata Working Group Q1 2020 Jenny Mahuika, Data Librarian TERN Data Services and Analytics [email protected]

Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Controlled Vocabulariesand Data Integration

Presentation to ICSM Metadata Working Group Q1 2020

Jenny Mahuika, Data Librarian

TERN Data Services and Analytics

[email protected]

Page 2: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Introduction to the Terrestrial Ecosystem Research Network

The challenge of harmonising diverse data

Process and Examples

Current Status

Page 3: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Introduction to the Terrestrial Ecosystem Research Network

Page 4: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Australia’s Land Ecosystem Observatory

Page 5: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

TERN Purpose1

National infrastructure for collecting, collating, storing and sharing Australia’s terrestrial

ecosystem data sets and knowledge.

1TERN is supported by the Australian Government through the

National Collaborative Research Infrastructure Strategy from 2009

Page 6: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

• Satellite remote sensing

products

• Land cover dynamics and phenology

• Vegetation composition and structure

• Fire dynamics and impacts

• Continental Soil & Landscape data

• Carbon, energy, water fluxes

• Phenocams

• Acoustic sensors

• Flora population

• Plot-based surveillance monitoring

• Soil sample, leaf tissue samples, LAI, Basal

area

TERN in Operation

Page 7: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

The challenge of harmonising diverse data

Page 8: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Ecosystem science data• Messy

– Combination of human and sensor observation at different spatial and temporal extents

• Diverse

– As above but also different types and formats

– Point, Grid, time-series, one-off, wide geographical extent

Page 9: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Structural Growth Form

Page 10: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

• Objective

combine data from different sources into usable and trusted information

Page 11: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Controlled vocabularies provide an opportunity to harmonise at different scales and across different domains

Page 12: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

HarmonisationGeneral > Specific

GCMD Science Keywords

ANZ Fields of Research

– Platforms, Instruments

– Observed properties

– etc

Page 13: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Vocabularies are key– Platforms, Instruments - TERN vocabularies, based on SOSA ontology, aligned with

GCMD

– Spatial regions – Australia’s Bioregions (IBRA), Ecoregions, States and Territories

– Spatial resolution, Temporal Resolution, Content type - GCMD terms

– UoM – QUDT ontology

– Observed properties – TERN vocabulary, RDF, aligned with EnvThes

– Methods/procedures – TERN vocabulary, RDF

– Organisations, Projects, People – TERN vocabularies, based on schema.org

GCMD https://gcmdservices.gsfc.nasa.gov/static/kms/ many also available through ANDSEnvThes http://vocabs.lter-europe.net/edg/tbl/EnvThes.editor

Page 14: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Ultimate FOI

• IBRA

• Ecoregions

• Climatic regions

Platform

• eddy covariance flux

Instrument

• Kipp and Zonen –

• Pyranometer

• CNR1

Observed properties

• Radiation

Procedure

• procedure used

Spatial resolution

• Point Resolution

temporal resolution

• 30 minutes

content type

• NetCDF

Data from Flux tower

Page 15: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Ultimate FOI

• IBRA

• Ecoregions

• Climatic regions

Platform

• Ecology sites

Instrument

• Clinometer

Observed properties

• Vegetation Height

Procedure

• Vegetation Canopy Height Assessment Method

Spatial resolution

• 100 meters -< 250 meters

temporal resolution

• One-off

content type

• CSV

Data from Field Ecology

Page 16: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Process and Examples

Page 17: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

To-Be Process

Page 18: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Viewer

Page 19: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Data SubmissionAcknowledgement:

TERN

acknowledges

initial development

of the tool and

documentation by

the Australian

Ocean Data

Network (AODN)

and the Institute for

Marine and

Antarctic Studies

(IMAS).

Page 20: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and
Page 21: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and
Page 22: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and
Page 23: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and
Page 24: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

GeoNetwork

Page 25: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and
Page 26: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Data Discovery Portal

Page 27: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and
Page 28: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

Current StatusGoals Status

Improve data submission capabilities

SHaRED v3.0 PilotTesting metadata migrationand refining process

Adopt or develop controlled vocabulary to describe platform, instruments, Observable properties, UoM, Spatial and temporal resolution, organisations and people.

Adopted GCMD terms for spatial and temporal resolutionAdopted QUDT terms for UOM*Developed organisations and peopleWork in Progress: platforms, instruments, Observable properties

Page 29: Controlled Vocabularies and Data Integration · TERN Purpose1 National infrastructure for collecting, collating, storing and sharing Australia’sterrestrial ecosystem data sets and

tern.org.au

Acknowledgements:

TERN Vocabs: https://linkeddata.tern.org.au

Data Access: https://portal.tern.org.auData Visualisation: https://maps.tern.org.auCloud and Virtual desktop platform: https://coesra.tern.org.au

https://ecocloud.org.au