Realizing the GPRAMA using Government Linked Data

Preview:

DESCRIPTION

This presentation was given at the 2011 DoD symposium on SOA & Semantic Technology, and demonstrates the use of open standard metadata tags to implement the Government Performance and Results Act Modernization Act (GPRAMA) using topical examples like cloud computing, and the meaningful use of electronic health record exchanges.

Citation preview

Realizing the GPRAMA using Government Linked Data

George Thomas, HHS

DoD 2011 SOA & Semantic Technology Symposium

2011-07-13, 3:45-4:20pm, standards (green) track

2

About me…

• HHS OCIO – Office of Enterprise Architecture– Working on a variety of (mostly ACA related) modernization projects

• Data.gov PMO Semantic Web / Linked Data lead– TPC: open to all! – Send me an email if interested in participating…

• Clinical Quality Linked Data (CQLD)– With the Centers for Medicare and Medicaid Services (CMS)– See CQLD blog post on health.data.gov

• W3C Government Linked Data Working Group (GLD)– Co-chair, member only, see W3C GLD wiki– Focused on SW standards and best practices for OGD

• Graduate School– SOA Instructor– Part of their EA Certificate program

3

This Presentation

• Gov Performance & Results Act Modernization Act– GPRAMA overview

• IT Dashboard– Capital Planning Investment Control (CPIC) data

• Object Management Group Standards– Business Motivation Model (BMM)

• ‘Bizmo’ Linking Vocabulary– CPIC + BMM

• Agency Data Creation and Publication– Empowering content owners, exposing machine readable data

• Freebase Demo– Finding IT investments that support Federal Gov Goals

• Data Syndication and Aggregation– Architectures and tool examples

4

GPRAMA Brief Overview

• Strategic Planning– Qualifying the Ends

• Performance Planning– Quantifying the Means

• Reporting– Annual to quarterly

• Federal Priority Goals– Cross Agency, Government-wide

• Agency Priority Goals– Intra Agency

• (Formalizing new/existing Organizational Roles)• (Training…)

5

itdashboard.gov -> HHS Ex300

• Selecting columns to populate CSV

6

OMG: BMM + CPIC (+ PM + RM) = MPG

Model for Performance Driven Government

Business Motivation Model (BMM)

8

BMM Terms

• A desired result is a generalization of goals and objectives – A goal is something an Org is trying to achieve– An objective quantifies a goal, specifying timing and

measurement• A course of action is something an Org does to achieve a

desired result– A strategy is a broad, lasting course of action– A tactic is a narrow, fleeting course of action

• An influencer is something that can affect the Org’s ability to achieve its goals or implement its strategies

– An actuator is an influencer that can be considered as a quantity that can increase or decrease over time

• An assessment is a judgment of an influencer’s affect on an Org– Strengths, Weaknesses, Opportunities and Threats are

common kinds of assessments (SWOT)

9

BMM Business View: Means to Ends, Assets

10

OWL version of BMM

11

HHS Creates BMM Instance Model

12

HHS Publishes BMM Instance Data

13

Bizmo ‘Bridge’ Ontology in TBC-FE

14

Key /bizmo#Classes

15

Key Interlinking /bizmo#properties

– #supports properties link CPIC ex53/ex300’s• to BMM Strategies, Tactics, Goals, Objectives, etc.

– extending the Ex53/300 specs – without changing them!

– #maintainsExhibit53/300 link OrganizationalUnit’s• to CPIC investment information

16

Browsing SMW KB via Classes

17

Ontology Classes/Properties = Tags

• Create semantic annotations (part of wysiwyg editor tools)– auto-completion suggests tags to reuse from ontologies that have

been imported into the wiki (note existing tags from BMM ontology)

18

NHIN RDF/XML IEP: Export Excerpt

• Wiki instance data and metadata curation (SME edits) maintain class and property specs of ontologies imported off the Web when exported or accessed by other sites/users/apps

19

Browsing Linked Datasets on SMW

• Filtering through properties defined by Bizmo ontology• finds CPIC instance data that is linked to BMM instance data

20

SDW = SNS enabled LOD• This presentation is summarized by the interactive data below• And - it’s a Wiki – SME’s can easily add annotations and data!

21

Freebase Demo (1 of 4) – CPIC+BMM

• Using a modern browser, search for ‘Exhibit 53’– On http://freebase.com/labs/parallax

• Select ‘Exhibit 53 collection (2 topics)’ which takes you to;– http://www.freebase.com/labs/parallax/browse.html?type=%2Fbase%2Fbizmo%2Fe53

22

Freebase Demo (2 of 4) – CPIC+BMM

• Two ‘topic’ (instances of type Exhibit 53) are returned– From the schemas defined in the bizmo ‘base’

• One HHS and one GSA Exhibit 53– Data sourced from itdashboard.gov extracts (circa ~2009 – perseverance!)

• Click on ‘Contains’ (on the right) in the ‘Connections’ browser– Faceted browsing, RDF properties show up as ‘connections’…

23

Freebase Demo (3 of 4) – CPIC+BMM

• Three ‘topic’ (instances of type Exhibit 53 Recordset) are returned– All from HHS, these are Ex53 ‘row’ entries (note other facets on the left)

• Which of these supports an Administration Goal?– Click on ‘more connections’, then type ‘goal’ to filter properties on the fly

• Click on the ‘Supports Federal Goal’ property link (on the right)– in the ‘connections’ browser to filter the Exhibit 53 Recordsets

24

Freebase Demo (4 of 4) – CPIC+BMM

• One ‘topic’ (instance of type Goal) is returned– The Ex53 Recordset entry for the National Health Information Network

• Which links to an Administration Goal– ‘Health Care Reform’ (these are notional/exemplary instances…)

• You’ve just browsed from all Ex53 entries to a specific entry– Via the connections in the RDF Schema (as described previously)

25

Google’s PubSubHubBub (PuSH)

Hubs

Subscribers

Publishers

26

Google’s PubSubHubBub (PuSH)

• A feed URL (a "topic") declares its Hub server(s) in its Atom or RSS XML file, via <link rel="hub" ...>.

• The hub(s) can be run by the publisher of the feed, or can be a community hub that anybody can use. (Atom and RssFeeds are supported)

• A subscriber (a server that's interested in a topic), initially fetches the Atom URL as normal. If the Atom file declares its hubs, the subscriber can then avoid lame, repeated polling of the URL and can instead register with the feed's hub(s) and subscribe to updates.

• The subscriber subscribes to the Topic URL from the Topic URL's declared Hub(s).

• When the Publisher next updates the Topic URL, the publisher software pings the Hub(s) saying that there's an update.

• The hub efficiently fetches the published feed and multicasts the new/changed content out to all registered subscribers.

27

Sindice.com – SemWeb Index++

• Structured Data– RDF crawler

• Register URL’s– Manually– Automated

• Using Ping Submission API

• Check out– JS Widgets and

Sig.ma interface

28

Virtuoso Sponger

• Crawl/Ingest– Multiple

methods

• ‘Swiss Army Knife’

– Structured and unstructured formats

• Search– keyword, facet

29

Conclusions and Recommendations

• Linked Metadata– Bizmo relates/links – CPIC (what) with Motivation (why)

• Published Instance Data– Structured data – Gov Linked Data

• Decentralized, Federated Publishing– Agencies use light weight tools– Self-assessments and passbacks

• Syndication and Aggregation– Pull, push, feeds– Dashboard.data.gov?

30

Thank You! Questions?

• george dot thomas 1 at hhs dot gov• @georgethomas