15
Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Embed Size (px)

Citation preview

Page 1: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Architecture for Electronic Field Guides

Robert A. Morris

Robert D. Stevenson

UMASS-Boston

Page 2: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Electronic Field Guides

• Why are we interested in semantic processing

• What do we do now

Page 3: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Semantically based discovery

1. In what semantic categories does the provider metadata place the provider data?

2. In what semantic categories does the application place the query subject?

3. To what semantic categories does processing the ontology expand 1 and 2 and thereby inform the application which providers to query?

Page 4: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

A discovery scenario

• What is the scientific name of the flower whose common name is Alpine Lily?

• Issues– What is a scientific name?– What is a flower?– What is a common name?– What is a lily?– What is an Alpine Lily?– What is an alpine lily?– Who says so?

Page 5: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Alpine Lily is

• Lilium parvum – CalFlora, CalAcademy– www.blackdown-lilies.org.uk– enature.com

• (also: Fairy Lily, Sierran Tiger Lily)– ("Sierra Tiger Lily" in USDA Plants hence ITIS hence GBIF

COL)• Lloydia serotina

– USDA Plants hence ITIS hence GBIF COL– Gresham(OR) School Steen Mountain Checklist (1995)– ("Alp lily" at www.fs.fed.us)

Page 6: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Alpine Lily

Lilium parvum

© 2000 John Game

From CalFlora

Lloydia serotina

©? Betty FordAlpine Garden

Page 7: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Concepts varyConcept Taxonomist

meansField naturalist means

Novicemeans

Scientific Name ICZN, ICBN, … ICZN, ICBN, … A latin name

Common Name NA Local usage Whatever field guides say it is

Flower Part of an angiosperm

Part of an angiosperm or an angiosperm

A flowering plant or part of a flowering plant

Lily Species in Liliaceae family

Species in Liliaceae family

Species in Lily family

Alpine Lily NA Lloydia serotina;

Lilium parvium

Lloydia serotina;

Lilium parvium

alpine lily NA Lily that grows in an alpine setting

???

Page 8: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Simple novice semantic web

AnimalPlant

Flower Mammal

Taxon

is-a is-a, deduced by rule

CalFlora

USDA Plants

MSW

ITIS

serves

Page 9: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Where to find prototype

• http://lionhead.cs.umb.edu/wsdemo– Work of Hui Dong– Select "Integrated Scientific Name Service"

• Generally: http://www.cs.umb.edu/efg

Page 10: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Goals of the EFG Project

• ID and descriptive data services on the web

• No assumptions about structure of characters or character states

• Biologist should be free of informatics professionals

• Participate in federations of (biodiversity, neuroscience, ...) data sources

Page 11: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Query EngineServlet

Presentation Layer XSLT Engine

(Xalan)

Bac

k E

nd (

OS

tore

)

http GET

xmlinternet java

java

query formulation

POST

JDOM

html,specialized XML,...

servlet forwarding

Architecture of a UMASS Boston Electronic Field Guide

Page 12: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

EFG Web Services

• Provide – descriptive pages

• Idiosyncratic but simple schema• TDWG SDD Schema (alpha .01...)

– Interactive and machine mediated remote or local identification tools

– Metadata (Darwin Core?)– Georeferenced checklists (collaboratively with digital

gazetteers (ADL DG)– recording of field observations

Page 13: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

EFG Web Services

• Query schema– cgi prop1=value1&prop2=value2..– regular expressions (more or less) on

prop=value atoms– digIR (TDWG Access to Biological Collections

Data, Initial protocol for GBIF Electronic Catalog of Names). Designed generally but implemented for specimen collections data applications.

Page 14: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

EFG plans and projects

• Elaborate on multimedia keys• Semantic discovery• K12 field Guide Production• Trusted access control, filtering, and integration,

e.g. protect geolocations of endangered species• Ontology of invasive species, especially

geographically based (cf. gazetteers)• Integrated common/scientific name services

Page 15: Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston

Hermathena oweni Schaus (Riodinidae)© 2000 William Haber

from UMB EFG Project