17
Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model Cynthia Chang 1 , Stephan Zednik 1 , Chris Lynnes 2 , Peter Fox 1 , Deborah McGuinness 1 , Gregory Leptoukh 2 , Jianfu Pan 3 1. Tetherless World Constellation, Rensselaer Polytechnic Inst., Troy, NY, United States 2. NASA Goddard Space Flight Center, Greenbelt, MD, United States 3. Adnet Systems, Inc. IN43C-06

Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Embed Size (px)

DESCRIPTION

Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model. Cynthia Chang 1 , Stephan Zednik 1 , Chris Lynnes 2 , Peter Fox 1 , Deborah McGuinness 1 , Gregory Leptoukh 2 , Jianfu Pan 3. - PowerPoint PPT Presentation

Citation preview

Page 1: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Experiences Developing a User-centric Presentation of A Domain-

enhanced Provenance Data Model

Cynthia Chang1, Stephan Zednik1, Chris Lynnes2, Peter Fox1, Deborah McGuinness1,

Gregory Leptoukh2, Jianfu Pan3

1. Tetherless World Constellation, Rensselaer Polytechnic Inst., Troy, NY, United States

2. NASA Goddard Space Flight Center, Greenbelt, MD, United States

3. Adnet Systems, Inc.IN43C-06

Page 2: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Giovanni Earth Science Data Visualization & Analysis Tool

• Developed and hosted by NASA/GSFC

• Multi-sensor and model data analysis and visualization online tool

• Supports dozens of visualization types

• Generate dataset comparisons

• ~1500 Parameters

• Used by modelers, researchers, policy makers, students, teachers, etc.

2

Page 3: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Challenge

• Giovanni allows users to run analyses with most of the data processing performed before or in Giovanni

• This is the primary value add of Giovanni, but it presents the possibility of users generating and using results they do not fully understand

• We are challenged to instrument the system to help users understand results

3

Page 4: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Challenge (cont.)

Particularly when the fitness for use of the results may be in question…

• Unexpected and unexplained anomalies possible in analysis and comparison results

• Users may not have the expertise to recognize or understand anomalous results

• Many anomalies/biases the result of complex sets of interrelated conditions

4

Page 5: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

South Pacific Anomaly

Anomaly

5

Page 6: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

…is caused by an Overpass Time Difference

6

Page 7: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Multi-Sensor Data Synergy Advisor (MDSA)

• Construct knowledge base of service operations and inputs– Giovanni service operations to be performed– Provenance of selected data sources

• Analyze knowledgebase for potential or known anomalies– Expert rules

• Utilize provenance, science, and data processing info

– Semantic comparison of data source and processing provenance properties

• Advise– Compile an advisory report for the user

7

Page 8: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Multi-Domain Knowledgebase

• Capture Giovanni and Data

Source provenance

• Associate provenance entities

with domain metadata

described by independent

domain models

• Expert Rules operate over

multi-domain knowledgebase,

test for conditions that may

affect result’s fitness for use8

Page 9: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Advisor Knowledge Base

9Advisor Rules test for potential anomalies, create

association between service metadata and anomaly metadata in Advisor KB

Page 10: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Advisor Presentation Requirements

• Present metadata that can affect fitness for use of result

• When analysis is a comparison

– Make obvious which properties are comparable

– Highlight differences where present

• Present visuals (where possible) and descriptive text for any anomalies predicted by expert ruleset

• Presentation must be understandable by Earth Scientists

10

Page 11: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Advisory Report

• Tabular representation of the semantic equivalence of comparable data source and processing properties

• Advise of and describe potential data anomalies/bias

11

Page 12: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Giovanni Provenance Visualization Requirements

• Exercise existing provenance visualization tools to show Giovanni processing provenance

• Visualization tool must support access to multi-domain metadata knowledgebase (not just provenance metadata)

– Science metadata adds domain context to entities in the provenance trace

• Presentation must be understandable by Earth Scientists

12

Page 13: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Domain-integrated Provenance Visualization

13

Page 14: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Domain-integrated detail view

14

Page 15: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Conclusion, Q&A

• Different provenance needs = different provenance presentation requirements

• Science User feedback– “Advisory report is very helpful”– “Too much provenance metadata to dig through, I need a

simplified abstraction/view”

• Future Work– Advisor suggestions to correct for potential anomalies– Views/abstractions of provenance based on specific user

group requirements– Continued iteration on visualization tools based on user

requirements

15

Page 16: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

References

• System Transparency, Or How I Learned to Worry about Meaning and Love Provenance!, Zednik, S., Fox, P., McGuinness, D., IPAW 2010, to appear in Springer conference proceedings.

16

Page 17: Experiences Developing a User-centric Presentation of A Domain-enhanced Provenance Data Model

Links

• Giovanni Earth Science Data Analysis Tool– http://disc.sci.gsfc.nasa.gov/giovanni/ (Production site)

– http://giovanniplus-ts1.sci.gsfc.nasa.gov/daac-bin/G3/gui.cgi?instance_id=MDSA-case1

(MDSA site)

• MDSA– http://tw.rpi.edu/web/project/MDSA (Project site)

• PML– http://inference-web.org (Inference Web)

– http://inference-web.org/2007/primer/ (PML Primer, 2007) 17