11
The Provenance Vocabulary http://purl.org/net/provenance/ Olaf Hartig Jun Zhao W3C Provenance Incubator Group call May 14, 2010

Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

Embed Size (px)

Citation preview

Page 1: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

The Provenance Vocabularyhttp://purl.org/net/provenance/

Olaf HartigJun Zhao

W3C Provenance Incubator Group call May 14, 2010

Page 2: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 2

Our Goal

● Integrate provenance metadata into the Web of data to enable information quality assessment

● Vocabulary to describe provenance of Linked Data on the Web

● Easy to use● By people who provide Linked Data● By developers of Linked Data publishing tools

Page 3: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 3

Overview of the Vocabulary

● Defined as an OWL ontology● Partitioned into:

● Core ontology● Supplementary modules: types, integrity verification

Page 4: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 4

Overview of the Vocabulary

Page 5: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 5

Example

< > a prv:DataItem ; foaf:primaryTopic <http://example.org/gene/0030840> . prv:createdBy [ a prv:DataCreation ; prv:performedAt "2010-03-01...00:00"^^xsd:dateTime ; prv:performedBy <http://example.org/triplify> ; prv:usedGuideline _:a ; prv:usedData _:b ] .

<http://example.org/triplify> a prv:NonHumanActor ; rdfs:comment "Instance of Triplify V0.5" ; prv:operatedBy <http://olafhartig.de/foaf.rdf#olaf> .

The data item was created by a process, performed at a given time and performed by a service ex:triplify that was operated by myfoaf:olaf.

Page 6: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 6

Example cont.

_:a a prvTypes:TriplifyConfiguration , prv:CreationGuideline; prv:createdBy [ a prv:DataCreation ; prv:performedBy <http://olafhartig.de/foaf.rdf#olaf> ] .

_:b a prv:DataItem ; prv:retrievedBy [ a prv:DataAccess ; prv:performedAt "2010-03-01T12...00:00"^^xsd:dateTime ; prv:performedBy <http://example.org/triplify> ; prv:accessedService [ a prv:DataProvidingService , prvTypes:JDBCService ; foaf:homepage <http://flybase.org/> ] ] .

The source data item _:b –used to create the data item described before– was retrieved by ex:triplify by accessing a public JDBC access point.

Page 7: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 7

Design Principles

● Usability and understandability● No specific granularity prescribed

● void:Dataset rdfs:subClassOf prv:DataItem● rdf:Statement rdfs:subClassOf prv:DataItem

● Other vocabularies for more detailed descriptions of certain aspects● OPMV, PML, HTTP vocab, Changeset, etc.

● Schema-level links to related Vocabularies● e.g. prv:Actor owl:equivalentClass foaf:Agent

Page 8: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 8

Provenance-enabled Publication

● Metadata componentsfor widely used LinkedData publishing tools● Triplify● Pubby● D2R Server

● Automatic generationof provenance metadata

● Simple configuration

Page 9: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 9

Future Work

● Alignment with other provenance-related vocabularies and models

● Additional modules for specific aspects thatare not covered by other vocabularies

● Integration in other publication tools

Page 10: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 10

These slides have been created byJun Zhao and Olaf Hartig

This work is licensed under aCreative Commons Attribution-Share Alike 3.0 License

(http://creativecommons.org/licenses/by-sa/3.0/)

Page 11: Brief Introduction to the Provenance Vocabulary (for W3C prov-xg)

May 14, 2010 The Provenance Vocabulary 11

wasControlledBy(prvTypes:DataCreatingService)or better

wasControlledBy(prv:performedBy) ?

used(prv:usedData)

used(prv:usedGuideline)

< >

wasGeneratedBy(prv:DataCreation)

ex:triplify

_:b

_:a

Idea for a Mapping to OPM

wasGeneratedBy(prv:retrievedBy)or better

wasGeneratedBy(prv:DataAccess) ?

wasControlledBy(prv:DataAccessor) or better

wasControlledBy(prv:performedBy) ?

wasControlledBy(prv:DataProvidingService) or better

wasControlledBy(prv:accessedService) ?myfoaf:olaf wasGeneratedBy(prv:DataCreation)

wasControlledBy(prvTypes:DataCreatingEntity)