14
The role of persistent identifiers in tracking taxon changes Andrew C. Jones, Richard J. White, Ewen R. Orme, School of Computer Science, Cardiff University, UK {Andrew.C.Jones | R.J.White | E.R.Orme} @cs.cardiff.ac.uk

The role of persistent identifiers in tracking taxon changes

  • Upload
    hasana

  • View
    28

  • Download
    0

Embed Size (px)

DESCRIPTION

Andrew C. Jones, Richard J. White, Ewen R. Orme, School of Computer Science, Cardiff University, UK {Andrew.C.Jones | R.J.White | E.R.Orme} @cs.cardiff.ac.uk. The role of persistent identifiers in tracking taxon changes. The Catalogue of Life. GSD. Web front-end. GSD. Other software - PowerPoint PPT Presentation

Citation preview

Page 1: The role of persistent identifiers in tracking taxon changes

The role of persistent identifiers in tracking taxon changes

Andrew C. Jones, Richard J. White, Ewen R. Orme,School of Computer Science,

Cardiff University, UK

{Andrew.C.Jones | R.J.White | E.R.Orme} @cs.cardiff.ac.uk

Page 2: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)2

The Catalogue of Life

GSD

GSD

GSD

CAS

Web front-end

Othersoftwareclients ofCatalogue ofLife (e.g.using it as their“taxonomicbackbone”)

Page 3: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)3

CoL in use

Page 4: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)4

CoL & LSIDs

Page 5: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)5

Concepts that stay the same

Sci. name 1Synonyms:

Sci. name 2Sci. name 3Sci. name 4

urn:lsid:catalogueoflife.org:taxon:<uuid 1>:dc

urn:lsid:catalogueoflife.org:taxon:<uuid 1>:ac2009

Dynamic checklist lsid

Annual checklist lsid

KEY:

Sci. name 1Synonyms:

Sci. name 2Sci. name 3Sci. name 4

urn:lsid:catalogueoflife.org:taxon:<uuid 1>:dc

urn:lsid:catalogueoflife.org:taxon:<uuid 1>:ac2010

Page 6: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)6

Evolving concepts in dynamic & annual checklist

Sci. name 1Synonyms:

Sci. name 2Sci. name 3Sci. name 4

Sci. name 1Synonyms:

Sci. name 3

Sci. name 2Synonyms:

Sci. name 4

Sci. name 1Synonyms:

Sci. name 3Sci. name 5

Sci. name 2Synonyms:

Sci. name 4

urn:lsid:catalogueoflife.org:taxon:<uuid 1>:dc

urn:lsid:catalogueoflife.org:taxon:<uuid 2>:dc

urn:lsid:catalogueoflife.org:taxon:<uuid 3>:dc

urn:lsid:catalogueoflife.org:taxon:<uuid 4>:dc

urn:lsid:catalogueoflife.org:taxon:<uuid 3>:dc

urn:lsid:catalogueoflife.org:taxon:<uuid 1>:ac2009

urn:lsid:catalogueoflife.org:taxon:<uuid 4>:ac2010

urn:lsid:catalogueoflife.org:taxon:<uuid 3>:ac2010Dynamic checklist lsid

Annual checklist lsid

KEY:

Page 7: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)7

Data integration and the CoL

• Two sources of information about species x: Do they refer to the same concept?

• Same persistent identifier If not, how are the concepts related; what can we

infer?• Different persistent identifiers• Needs something like TCS

Page 8: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)8

Specimen data & changing concepts

Page 9: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)9

Using data associated with changing concepts

Pipistrelluspipistrellussensu stricto

(CommonPipistrelle;45 kHz)

Pipistrelluspygmaeus

(SopranoPipistrelle;55 kHz)

Pipistrellus pipistrellus sensu lato (45 & 55 kHz)(Pre-1999)

Page 10: The role of persistent identifiers in tracking taxon changes

Don't know which new species these observations relate to ...

… but still applicable to genus Pipistrellus10

Page 11: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)11

Worse still …

• Though CoL taxa have precise circumscription when defined …

• … difficult precisely to know that concept when applying a CoL persistent identifier

• Identification keys for CoL taxa?

Page 12: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)12

Capturing taxon concept changes

• Changed persistent identifiers from source databases; or

• Detecting changes by comparison Same synonyms, parent taxon, etc?

Page 13: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)13

Representing the changes• Persistent identifier metadata

Taxon concept relationships e.g. isCongruentTo; includes; overlaps

• Granularity? Many species changed due to underlying cause, e.g.

splitting a genus? Higher taxa need relationship metadata too

Additional explanatory metadata attached to species (set of relationships between relevant higher taxa)?

Explicit representation of the actions leading to change, e.g. “split”, “merge” & “transfer”?

Page 14: The role of persistent identifiers in tracking taxon changes

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)14

Issues for discussion• Differing perspectives of users, providers (and computer

scientists)

• Need for conventions in describing evolving checklists

• Metadata describing actions, not just set relationships?

• Services to support data integration exploiting persistent identifiers

• When does a concept really change?

Some URLs ...

• 4D4Life project: http://www.4d4life.eu

• 4D4Life questionnaire: http://biodiversity.cs.cf.ac.uk/4d4life/