Upload
ljs-infodocket
View
217
Download
0
Embed Size (px)
Citation preview
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
1/40
Data in San Francisco:Meeting supply, spurring demand
City and County of San Francisco
Mayor Edwin M. Lee
Joy Bonaguro, Chief Data Officer
July 31, 2015
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
2/40
Table of Contents
1. Executive Summary
2. Mission, Vision and Approach
3. Looking Back: The Year in Review
4. Looking Forward: Year 2 Goals and Strategies
Overview of Approach and Goals
Goal 1. Make timely data easily available
Goal 2. Improve the usability, quality and consistency of our data
Goal 3. Support increased use of data in decision-making
Goal 4. Identify and foster innovations in open data and data use
Goal 5. Continuously improve, scale, maintain and monitor our work
5. Priority, Resource and Contingency Analysis
6. Conclusion
Appendices
Appendix A. Acknowledgements
Appendix B. Detailed Accomplishments in Year 1
Appendix C. Quarterly Milestones for Year 2
Appendix D. Crosswalk between plan and Open Data Policy
Data in San Francisco: Meeting supply, spurring demand - Return to Top 2 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
3/40
1. Executive Summary
Our Mission and Vision
At DataSF, we are working to transform the way the City works through the use of data. Our
mission is to empower use of the City’s Data. Our vision is that the City’s data is understood,documented, and of high quality. The data is published so that it is usable, timely, and
accessible, which supports broad and unanticipated uses of City data. City employees have the
skills and capacity to collect, manage, and use data effectively and efficiently across its lifecycle.
The Ultimate Impact of Our Work
Through the dissemination and use of City data, we can:
● Improve City services for residents and businesses,
● Generate jobs and economic activity and
● Increase resident engagement and empowerment.
These in turn support increased quality of life and work for San Francisco residents, employers,
and employees.
Our Key Accomplishments in Year 1
Below are some of our key accomplishments in Year 1. Section 3 of this document goes into
greater detail for each goal area.
Completed the dataset inventory
Our core charge in Year 1 was completing a dataset inventory to list all of the datasets in each
department. This was an immense task and took up a great deal of our effort and time in the last
year. Our Department Data Coordinators were key to this task and without them it would not
have been possible. Learn more in our blog post on the inventory.
Relaunched our open data portal and created a web home for DataSF
Our web presence needed a total overhaul to ensure that we could better support our users,
whether seeking data or working to publish it. In addition to the open data portal, we needed to
create enduring resources like our publishing and coordinators portal as well as our resource
library and blog. Learn more in our blog post on the redesign.
Standardized publishing methods and metadata requirements
Standardizing the publication of datasets ensures high quality publishing over time. Consistent
information about published datasets makes the data easier to use, fostering more and better
use of the data. We took into account best practices from around the world and the tailored
them to San Francisco to ensure quality publishing. Learn more in our blog post on metadata.
Established a Citywide open data license for published data
The City needed a licensing strategy designed for data. A single license reduces ambiguity for
users and ensures that our data can be fully leveraged by individuals and companies alike. We
officially adopted the Public Domain Dedication License (PDDL) to meet the particular needs of
Data in San Francisco: Meeting supply, spurring demand - Return to Top 3 of 40
http://datasf.org/blog/u-heart-metadata/http://datasf.org/blog/building-lighter-and-faster/http://datasf.org/blog/5-ways-to-scale-mountain-of-data/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
4/40
open data. Learn more in our blog post on PDDL.
Launched the Housing Data Hub
The Housing Data Hub, http://housing.datasf.org/ , is a single place to learn about affordable
housing data programs in San Francisco and the administrative data behind them - visualized
and easy to use. This was our first strategic release - the bundling of open data publication withproducts that put the data to immediate use. Learn more in our blog post on strategic releases.
Launched the Data Academy
Working in partnership with the City Services Auditor, we launched a training program that
covers the whole lifecycle of data - from planning, collection, management, analysis to design
and publishing. Classes are booked out and demand is insatiable. Read about Data Academy.
Developed a strategy to improve confidential data sharing
Internal confidential data sharing is hampered by a legal thicket and poorly integrated technical
systems. Working in partnership with the City Services Auditor and more than a dozen City
departments, we put together a strategy to promote data sharing that is efficient, effective,
consistent, secure, and appropriate.
Advocated for and obtained additional resources
Our resource strategy for Year 1 was to 1) seek institutional homes and partners for our work
and 2) pursue dedicated resources where appropriate and with good justification. This time last
year, we were a team of one. We doubled our team with the role of the Open Data Program
Manager last fall. And during the year, we put together business cases to double yet again with
new roles to support 1) open data services and 2) support execution of our confidential data
sharing project. We will continue to work closely with key partners around the City doing similar
work.
Our Roadmap for Year 2: From Foundation to Use
Our Year 1 plan was about building a foundation for the future and creating the institutional1
support to grow use and dissemination of data in San Francisco. In Year 2, we need to build
upon that foundation and ensure a ready and predictable supply of data that is addressing data
gaps and needs. If last year was about building the house, this year is about moving in and
throwing a big house-warming.
Year 2 Goals and Subgoals
For Year 2, we are structuring our work around five core goals and subgoals as needed.
Goal Subgoals (where appropriate)
Goal 1. Make timely data easily
available
1. Increase number and timeliness of
datasets on SF OpenData
2. Enable use of private data, while
appropriately protecting it
1 Read “Open Data in San Francisco: Institutionalizing an Initiative
” via google docs.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 4 of 40
https://docs.google.com/document/d/1hvp_wls8KuJrfHW_NwX1qtyFR4EFdWCkxcULnNlhKNw/edit?usp=sharinghttp://datasf.org/academy/http://datasf.org/blog/housing-data-hub-launched/http://housing.datasf.org/http://datasf.org/blog/data-license-liberation-day/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
5/40
3. Streamline internal data access
Goal 2. Improve the usability, quality
and consistency of our data
Goal 3. Support increased use of datain decision-making 1. Increase internal capacity2. Support public capacity
3. Foster and incent a data culture
Goal 4. Identify and foster innovations
in open data and data use
Goal 5. Continuously improve, scale,
maintain and monitor our work
As we execute on these goals and supporting strategies we look forward to reporting on key
accomplishments next year. Below are a handful of accomplishments we plan to achieve thisyear:
● Fully deployed data automation as a service to ease data publication
● Deployed better, friendlier publishing for geographic data
● Identified methods to crowdsource collective intelligence about published datasets
● Launched new transparency websites
● Engaged our broader community around a handful of key issues or datasets
● Developed “Data Concierge” to streamline internal data access for City employees
● Established center to facilitate and standardize confidential data sharing
● Began to systematically tackle data quality
● Developed Data Academy into a professional development strategy● Enriched our data through effective storytelling
We encourage you to visit our website, at datasf.org/about to track our progress over the next
year. We will post quarterly reports on our strategic plan, including updates and revisions. You
can also view our publishing progress at datasf.org/progress.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 5 of 40
http://datasf.org/progress/http://datasf.org/about/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
6/40
2. Mission, Vision and ApproachOur mission is to empower use of the City’s Data. Our vision is that the City’s data is
understood, documented, and of high quality. The data is published so that it is usable, timely,
and accessible, which supports broad and unanticipated uses of our data. City employees have
the skills and capacity to collect, manage, and use data effectively and efficiently across itslifecycle.
Like our Year 1 plan, our Year 2 plan is ambitious. To execute on our plan we will adhere to
some core approaches for how we manage our work:
1. Say no to perfection. We don’t have enough time for perfect. Something is better than
nothing and you can always improve it as you learn more.
2. Fail early and often. Failing is ok - not learning from a failure is not ok. Small
experiments, failed or successful inform our next steps.
3. Plan for the future. Create infrastructure and systems for future growth - but solve
immediate problems and pain points along the way
4. Use long division. If a problem seems too big, break it into manageable bits. There’s
always a hook or a starting point to move something forward.
5. No ugly, old IT. We leverage existing, modern, and light-weight tools and we want our
designs to be beautiful, inviting but also a little fun.
6. Use storytelling and data. We must work to find the people in the data and tell their
story. Data without people is just academic.
7. Seek institutional homes. Distribute, share and foster excellence. While we may
incubate programs, ideas or projects, we ultimately need to find a full-time home.
8. Learn to infinity and listen with humility. Continuously learn from ourselves and
others and build on existing frameworks. “Not invented here” attitudes are strictly
prohibited.
9. Start with problems, move to opportunities. We start with people's needs and
problems but also use the chance to show them some cool, new stuff for the future.
10. If we don’t start now, we’ll never get there. We don’t want to look back in five years
and think “if we had just…”. Every shady street started with a row of saplings.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 6 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
7/40
3. Looking Back: The Year in Review
Summary and Reflections
Building the Foundation
In Year 1 our focus was defining the scope of the program, identifying and developing keypartnerships, and of course, building out the programmatic infrastructure, including core
services, business processes, and roles and responsibilities. The work we completed in the last
year provides the foundation upon which we will build our data work for the City. While the
foundation is not yet complete, we have made tremendous progress.
It Takes a Village
A huge portion of that progress is due to key partnerships with the Department of Technology
(in particular, the DT GIS team for data automation and services) and the Controller’s City
Services Auditor (for a variety of projects). These partnerships allowed us to execute on several
components of our strategic plan that were not resourced at the start of last year. We expect
these partnerships to grow and strengthen over the next year and we are already exploring newpartnerships within and outside the City.
Much of our open data work this year would not have been possible without our Data
Coordinators. Our coordinators were essential in conducting major aspects of our Year 1 plan,
including the dataset inventory that lists all datasets held by the City and County. The effort and
quality of their contribution cannot be understated. Thank you Data Coordinators!
We also received a huge infusion of talent and energy from our interns and graduate students
throughout the year. And the partnership that has emerged with our local Code for America
Brigade, Code for San Francisco, has been invaluable - not only with projects but as a means of
keeping us real.Lastly, we added an incredibly talented person to the core open data team - Jason Lally. His
passion, insight and effort as our Open Data Program Manager has been at the heart of almost
every key accomplishment this year.
Appendix A includes a detailed list of the many thanks we owe from this last year.
Program Highlights
In the sections below, we cover highlights for each goal. Appendix B includes a link to our final
milestone report and includes a summary table describing the accomplishments by strategy in
greater detail.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 7 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
8/40
Goal 1. Increase number and timeliness of datasets on DataSF
Completed the dataset inventory
Our core charge in Year 1 was
completing a dataset inventory to list all
of the datasets in each department. Thiswas an immense task and took up a
great deal of our effort and time in the
last year.
The full inventory is published as a
dataset on SF OpenData or you can
view the visual link as shown in the
screenshot (this uses a new feature on
the data portal called Data Lens).
Our Data Coordinators were a key part
of making this successful and it would
not have been possible without them.
They are the true heroes in this effort.
Given that we had found little guidance on how to conduct a comprehensive data inventory, we
documented our approach and reflections in a blog post “5 Ways to Scale the Mountain of Data
in Your Organization.” Our hope is that other open data programs can learn from our
experience. We also made all of our materials available in our Resource Library.
As of the end of the fiscal year, 75% or 39 of 52 departments had completed or partially
completed the inventory. We will add additional departments on a rolling basis. In addition, we
are building a whole series of tools and resources on top of the data inventory - turning it into aplatform.
Developed three key methods to prioritize data for publication
Our dataset inventory includes over 700 datasets and counting and none of them come with a
magical publish button. So we have to prioritize our data for publication. We developed 3 key
methods to do so (and soon a 4th):
1. Department Drip. As part of the inventory, we asked departments to prioritize their data
as a function of value and data classification and to inform publishing plans
(forthcoming).
2. Endorse a Dataset. The data inventory can be used to elicit both internal and externalendorsements to publish data. While we haven’t built this yet, it is coming soon.
3. Strategic (or Thematic) Releases. One of the challenges of open data is that it often
involves the release of unrelated data in a haphazard manner. Strategic releases are
born out of a belief that simply publishing data is no longer sufficient. Open data
programs need to take on the role of adding value to open data versus simply posting it
and hoping for its use. One way is to release a body of data plus a product that puts the
data to use out of the gate. This can help open data become more relevant to a local
Data in San Francisco: Meeting supply, spurring demand - Return to Top 8 of 40
http://datasf.org/resources/http://datasf.org/blog/5-ways-to-scale-mountain-of-data/http://datasf.org/blog/5-ways-to-scale-mountain-of-data/https://data.sfgov.org/view/dztp-vt7zhttps://data.sfgov.org/City-Management-and-Ethics/Dataset-Inventory/y8fp-fbf5https://data.sfgov.org/City-Management-and-Ethics/Dataset-Inventory/y8fp-fbf5
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
9/40
audience that is focused on issues, not just apps, which is what we did with the Housing
Data Hub.
We provide more details on our approach in our blog post “ How to Unstick Your Open Data
Publishing” and you can view the prioritization grid for the Department Drip strategy below:
Launched data automation as a service in partnership with Department of
Technology
One of our key criteria under this goal is the timely and regular publication of data. If we rely on
individuals to publish data, we will not be able to scale our program. So we partnered closely
with the Department of Technology’s GIS team to develop the business model and supporting
processes and technology to offer data automation as a service. Later this year we’ll be
publishing our ETL Toolkit that will document our work and serve as both an internal and
external reference.
Launched support programs and portals for
Data Coordinators and Publishers
As part of the dataset inventory, we developed a
program to actively engage our Data Coordinators,
including creating a Data Coordinator web portal,
workshops, webinars and a slew of online
resources. And we’ve started the process of better
supporting our publishers with the launch of the
publishers portal at end of year. We expect oursupport effort for Data Coordinators to decrease
and publishers to increase in the next year.
View the Coordinators Portal and Publishing Portal.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 9 of 40
http://datasf.org/publishing/http://datasf.org/coordinators/http://datasf.org/blog/how-to-unstick-data-publishing/http://datasf.org/blog/how-to-unstick-data-publishing/http://housing.datasf.org/http://housing.datasf.org/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
10/40
Goal 2. Improve the usability of DataSF
Launched the new SF OpenData
Overhauling the open data platform was a core deliverable in Year 1. The image below shows
the before and after:
Our blog post, The New DataSF!, details more about the key usability changes we made.
Launched a new web home for our
overall program, DataSF
In addition to the portal overhaul, we
needed a new web presence to showcase
the rest of our work. This is also when we
branded the data portal to SF OpenData
and reserved DataSF for our overall
program.
Even better, the code we used to build the
website is freely available for others to
repurpose and use as you can read in our
blog post “Raising the digital barn”.
Collaborated on new portal features
Lastly, we partnered heavily with our vendor to introduce some new features to the portal. While
these are still in the works, we are excited about some of the new tools and features that will
help make open data easier to use for everyone - not just technical folks.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 10 of 40
http://datasf.org/blog/building-lighter-and-faster/http://datasf.org/blog/the-new-datasf/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
11/40
Goal 3. Improve the usability, quality and consistency of our data
Created and deployed a metadata standard for SF
OpenData
Ensuring that our data is well documented prior to
publication is a key part of making it usable. Unfortunately,metadata (data about data) is usually an afterthought. We
made it front and center and upped the documentation
requirements to ensure that our data is not simply
published - it is published with information that can help
folks use it.
You can read more about the process we followed and
how we elicited community and City feedback in developing the standard in our two blog posts
“Metadata & Dating - More in Common than you Think…” and “U Heart Metadata”. We also
published all of our metadata research and materials in our Resource Library.
Reset and standardized how we publish data on SF
OpenData
Part way through the year, we realized we needed to
dedicate work to resetting the published data on DataSF.
Much of the data had been published in inconsistent ways,
with varying standards and restrictions. We codified
publishing guidelines and incorporated them into the
Publishing Portal.
The reset work is still underway but a key visible
accomplishment was the relaunch, in partnership with thePolice Department and the DT Open Data Services team, of police incidents as a single
multi-year dataset natively hosted on SF OpenData. Previously, the data had been published as
separate shapefiles for each year with only the last 30 days on SF OpenData natively. Native
hosting allows you to easily generate maps and other visuals as shown in this map which shows
all police incidents since 2003 in a single map.
Deployed a help desk for incorporating and tracking user feedback and questions
Understanding what questions users have about our data helps us improve how we publish it.
Our user feedback methods were limited to a nomination form provided by our vendor. In lieu of
this we created our own Contact Us form and are tracking data questions and requests in a
single place using an enterprise ticketing system. By codifying and quantifying this, we canbetter respond to user needs. In Year 2, we’ll be expanding the number and types of user
feedback mechanisms we use.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 11 of 40
https://data.sfgov.org/Public-Safety/SFPD-Incidents-from-1-January-2003/tmnf-yvryhttps://data.sfgov.org/Public-Safety/SFPD-Incidents-from-1-January-2003/tmnf-yvryhttp://datasf.org/publishing/http://datasf.org/resources/http://datasf.org/blog/u-heart-metadata/http://datasf.org/blog/dating-data-what-do-you-look-for/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
12/40
Goal 4. Enable use of confidential data, while appropriatelyprotecting it
Goal 4 was largely dependent on resources.
Fortunately, we were able to partner in the Fall
with the City Services Auditor and several keyagencies to put together a comprehensive
strategy to address this goal. This was a key pivot
in our approach to focus on the use of data in the
context of coordinated care. The picture captures
an activity from one of our planning sessions.
Why Coordinated Care? Social service delivery
is in the midst of a migration from program to people centric care. Our most vulnerable
individuals touch multiple systems - education, human services, and criminal justice - which
have historically operated in silos. The transition to coordinated care will better meet the needs
of our clients by tailoring care to meet the needs of each individual, rather than administeringprograms with a one-size-fits-all approach.
A coordinated care approach is best carried out when multiple jurisdictions are able to share
data about the individuals they are jointly serving, so that efforts are not duplicated, and the
dosage of services is based on the right mix of supports. Unfortunately, most of our rules and
laws regarding data sharing were made within distinct verticals, such as health care, early
education, education, criminal justice etc. This legal thicket leads to an implementation thicket.
Each jurisdiction navigates this thicket afresh, which concentrates risk on individuals and
localities interpreting the law. In addition, to the legal work, we need coordinated policies and
procedures as well as the right mix of technology and supporting infrastructure.
The diagram below shows the focus of our project, which will be a multi-year effort, in the
context of coordinated care:
Data in San Francisco: Meeting supply, spurring demand - Return to Top 12 of 40
https://www.lucidchart.com/documents/edit/104ef250-2b3f-4882-b889-fa5b4616e979/0?callback=close&v=1317&s=612
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
13/40
Goal 5. Support increased use of data in decision-making
Launched Data Academy in partnership with the City Services Auditor
Our analyst survey from last year demonstrated an unmet need for more training in data use,
collection, and visualization. Fortunately, the team in the City Services Auditor was offering a
few classes and we teamed up to expand the number, type and frequency. We also launched awebsite for the Data Academy. The demand has been incredible and the feedback very positive.
Below is a picture from our Basics of Information Design class.
Developed the Stat Starter Kit in partnership with the City Services Auditor
While the Data Academy targets individual skills, we also wanted to support department skills in
using data. A variety of departments expressed interest in starting “performance stat” programs.
To respond to this, we partnered with the performance team in the City Services Auditor who led
the way putting together a series of resources to help departments start “stat” programs. We’llbe launching the Stat Starter Kit early in Q1 of Year 2.
Launched the Housing Data Hub in partnership with a village
While the previous programs support individual and department skills in data, we also wanted to
leverage open data to improve public capacity to use and understand City data. While we are at
the beginning of this journey, we were excited to launch the Housing Data Hub
(housing.datasf.org) this year.
The Housing Data Hub is a single place to learn about policies and programs related to housing
affordability as well as the administrative data behind them - contextualized and visualized for
easy consumption. This is part of a key strategy we are pursuing, which is to publish our data in
a way that is more meaningful and accessible for our local stakeholders who care about local
issues, not just applications. Read more on our thinking on what we are calling strategic or
thematic open data releases. We think this is a key part of fostering a data-enabled policy
environment.
The screenshot below shows one of the “data browser” views on the Housing Data Hub that
incorporates just in time learning moments that help explain the data visualized below. In
Data in San Francisco: Meeting supply, spurring demand - Return to Top 13 of 40
http://datasf.org/blog/housing-data-hub-launched/http://datasf.org/blog/housing-data-hub-launched/http://housing.datasf.org/http://datasf.org/academy/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
14/40
addition, users can link back to the original data on SF OpenData by clicking on “Get the source
dataset”.
The Housing Data Hub was another great example of “it takes a village”. We received help from
each of the key departments but also volunteers from Code for San Francisco. You can visit the
Hub and read more about all of those who contributed.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 14 of 40
http://housing.datasf.org/about/http://housing.datasf.org/http://housing.datasf.org/http://codeforsanfrancisco.org/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
15/40
Goal 6. Identify and foster innovations in open data and data use
Launched a blog and reclaimed our
twitter account
A key part of fostering innovation is
engagement and communications. Whenwe started last year, our Twitter account
had been abandoned and we had very few
ways of engaging and reaching our
audiences. While we have so much more
work to do here (and many things
upcoming), re-establishing our voice was a
key first step. You can read our blog at
DataSF Speaks and follow us on Twitter
@DataSF.
Launched the Resource Library Another way to foster innovation is to
document and share what we are doing so that other open data programs can benefit. We are
finding that folks use our online resources and will follow up with additional questions or
thoughts. We are also hearing from programs across the country (and occasionally world) that
have adapted our documents and resources for local use.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 15 of 40
https://twitter.com/DataSFhttp://datasf.org/blog/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
16/40
Adopted a licensing strategy designed to foster
open data reuse
One of the key issues in publishing data is ensuring that
it can be legally reused. Unfortunately, this topic does
not get enough attention. We surveyed the landscape to
come up with a licensing strategy that would fit the
unique needs of open data. And then we worked closely
with our legal team to put it in place. You can read more
about what we did in our blog post “Data License
Liberation Day“ and our research and related
documentation is available via the Resource Library.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 16 of 40
http://datasf.org/resources/http://datasf.org/blog/data-license-liberation-day/http://datasf.org/blog/data-license-liberation-day/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
17/40
4. Looking Forward: Year 2 Goals and Strategies
Overview of Approach and Goals
From Foundation to Meeting Supply/Spurring Demand
If Year 1 was about building the foundation, Year 2 is about buying furniture, painting the walls,hanging photos and throwing a housewarming party. It’s time to open the doors and not just let
folks in, but deliver the invite in-person. That’s why the theme for Year 2 is to fill out the supply
of data but also ensure that it’s being used by a broader range of people.
Goal Shifts in Year 2
We expected our Year 1 goals to hold steady for the next three years. While this is generally
true, we modified our goals to reflect some key insights:
● Two of our goals fit nicely under a broader goal of making timely data easily available.
As a result, we consolidated the following two goals under a broader goal of “make
timely data easily available”:
○ Increase number and timeliness of datasets on SF OpenData
○ Enable use of private data, while appropriately protecting it
● Our web presence demanded a huge amount of effort and focus in Year 1 to update it
and establish a new, comprehensive presence. While, we will continuously improve our
online tools, the goal now fits more appropriately under a goal centered on continuous
improvement and organizational excellence for the program.
Year 2 Goals and Subgoals
Goal Subgoals (where appropriate)
Goal 1. Make timely data easilyavailable
1. Increase number and timeliness ofdatasets on SF OpenData
2. Enable use of private data, while
appropriately protecting it
3. Streamline internal data access
Goal 2. Improve the usability, quality
and consistency of our data
Goal 3. Support increased use of data
in decision-making
1. Increase internal capacity
2. Support public capacity
3. Foster and incent a data culture
Goal 4. Identify and foster innovations
in open data and data use
Goal 5. Continuously improve, scale,
maintain and monitor our work
Data in San Francisco: Meeting supply, spurring demand - Return to Top 17 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
18/40
These goals continue to align with the three core challenges we identified for effective data use:
1) knowing what data we have, 2) having effective and efficient means of accessing it and 3)
using data effectively.
Challenges
Knowledge Access Ability
Goal 1. Make timely data easily available
Goal 2. Improve the usability, quality and consistency of our
data
Goal 3. Support increased
use of data in
decision-making
Goal 4. Identify and foster innovations in open data and data
use
Goal 5. Continuously improve, scale, maintain and monitor our work
The following sections describe the strategies in support of these goals. Appendix C provides a
link to a quarterly timeline and set of milestones for Year 2 and Appendix D provides a cross
walk with our open data policy that details how we are meeting the provisions of the legislation.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 18 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
19/40
Goal 1.Make timely data easily available
A precursor to using data is access. Open data, published on a shared platform, is one way of
making data available. In the near term we need to ensure that we are publishing or plan to
publish the City’s data when allowed. We should also publish the data at a frequency that
matches the rate of data change. For example, datasets that change daily should be refreshed
daily. Some data is only allowed to be shared internally as it may be protected by law or is not
available to be published in the near term. For these datasets, we need to ensure that we have
effective and efficient means of accessing and sharing data when it is appropriate to do so.
Subgoal 1.1 Increase number and timeliness of datasets on SF OpenData
Strategy 1.1. Continue to mature our program to automate publication of data. One of thekey challenges in opening data is extracting it from legacy systems and then preparing it for
broader consumption. Older systems were not designed with data exporting or sharing in mind.
Proprietary data formats need to be converted into modern, open formats, or the data may need
to be reorganized or structured in a way that supports public distribution. Lastly, the processes
that extract, transform and load data should be automated, such that after the initial
configuration, we have little to no overhead other than monitoring the ongoing process. In sum,
our automation program (activities summarized as extract, transform and load - ETL) is a critical
part of our overall program as it will support the key processes that ensure our data is extracted
appropriately and published in a timely manner on DataSF.
While we made excellent progress in Year 1, we need to ensure that our program continues to
develop to obtain economies of scale and to be sustainable. Key elements of this will be to
formalize business processes via automation, develop dedicated resources, scale and
standardize our technical implementation, and track and measure program performance.
Strategy 1.2. Develop self-service model for data automation for large departments.
While, we’ve committed to providing data automation as a central service, we recognize that
some departments are capable of and should have control over their data automation work. At
the same time, we want to ensure consistency and quality in the automation of data. Developing
a self-service model will help us obtain both goals.
Strategy 1.3. Target departments for wholesale data automation. During our data inventory(when we listed all datasets held across the city), we included a step that covered a list of
systems. Our analysis of this list suggests that some departments are good candidates for
wholesale automation - that is, their technical environment is homogenous and they have a key
technical contact that can streamline the work. For these departments, we will seek to automate
the publication of their data as a single project (versus relying on department publishing plans).
Data in San Francisco: Meeting supply, spurring demand - Return to Top 19 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
20/40
Strategy 1.4. Develop a geographic data access and publishing strategy. Our experience in
Year 1, suggested that we need to have a distinct strategy for publishing geographic data, in
particular data that consists of polygons (shapes/boundaries like police districts) and polylines
(lines like streets). The canonical geographic datasets in the City are broadly used both
internally and externally and have particular shared value that requires a more deliberateprocess for publishing (including geographic tools), data management, and communicating
metadata. The Department of Technology’s GIS team is a key partner in this work.
Strategy 1.5. Establish methods to ensure SF licensing and publication of data for new
information systems. While extracting data from legacy systems is painful, new systems
should be built with open data as a standard output. Any new information system should be
required to have automated outputs to support broader publication and dissemination of the
city’s data, while retaining the appropriate licensing. In Year 1, we were surprised to find little to
no best practices in this area. As a result we shifted the timing of this work and will seek to
complete it in Year 2.
Subgoal 1.2 Enable use of private data, while appropriately protecting it
Strategy 1.6. Create “ShareSF” hub and develop supporting resources and business
processes. As mentioned in the looking back section, we pivoted to a broader strategy for
confidential data sharing in Year 1. As part of that strategy, our office was tasked with
developing a “ShareSF” hub to facilitate internal confidential data sharing. Under this strategy,
we will develop the programmatic components of a hub, including standard business processes,
shared resources, legal frameworks, and governance.
Strategy 1.7. Explore technical solutions for confidential data sharing. The “ShareSF”
strategy also calls for the exploration of technical solutions to confidential data sharing. While
we expect this to have partial overlap with Strategy 1.11 below, we anticipate specific needs
and requirements related to implementing technical controls for legally protected data.
Strategy 1.8. Create a process for accessing your individual data. A process for accessing
data that the City holds about you will increase transparency and may help improve data quality.
Our work in Year 1 suggested that this is best incorporated into existing systems and processes
for data and information requests. As a result, we expect to wrap this process up in Year 2 and
will focus on guidance and outreach to educate departments on this type of request.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 20 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
21/40
Subgoal 1.3 Streamline internal data access
Through our City Analyst survey we have quantified the need for more effective and efficient
means of accessing data between departments. While the open data portal is a key repository
that we expect to leverage, some data either will not yet be available on the data portal or will be
available in a format that is less amenable to internal data work. In some cases, this subgoal will
complement Subgoal 1.2 for private data sharing.
Strategy 1.9. Develop methods to connect internal users to datasets. The dataset inventory
we created in Year 1 was a major step towards understanding the scope of the City’s data
holdings and addressing one of the key barriers - knowledge of data. Now that we have this list,
not only do we need to maintain it, we need to leverage it to support internal data access. While
other methods may emerge, tools built on top of the data inventory can support internal data
access for datasets that are not yet published (or not published in the best format for internal
use).Strategy 1.10. Integrate internal data access needs into emerging technology strategies.
As part of the Committee on Information Technology (COIT), the City has embarked on two key
strategies: 1) Shared services and 2) Public experience. We will participate in the development
of these strategies to ensure that the data access challenges we have identified are addressed
in these broader, long-term strategies.
Strategy 1.11. Explore options to develop shared data systems for internal use. The
number and variety of backend systems in the City is vast. While the open data portal may be
one shared system, we would like to explore options related to a more robust enterprise layer
for data access and management.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 21 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
22/40
Goal 2. Improve the usability, quality and consistency of our data
While Goal 1 provides access to the City’s data, the ultimate value of the data depends on its
usability, quality, and consistency. Usability helps us understand the data - what is it, how is it
collected, when is it published - the basic documentation that supports use of the data. Quality
speaks to how reliable and complete the data is - can we trust the conclusions or decisions we
make based on the data? Consistency helps us combine data from different systems, by using
consistent definitions across datasets, whether it’s race or ethnicity, service categories, target
populations, location, etc.
Strategy 2.1. Develop comprehensive data quality strategy for the City; implement via
pilots and broader COIT strategies. Our Year 1 experience suggested that the City would
benefit from a data quality framework and roadmap. We expect this to be a multi-year strategyin terms of development and execution. Over the next year we will identify motivated pilots to
roll out our data quality strategy. Research suggests that aligned pilots over time are the most
effective way to pursue a broader data quality approach. Pilots will likely include data
consistency standards, data model alignment, and data management guidance and tools.
As mentioned in Strategy 1.10, the City has embarked on two key technology strategies: 1)
Shared services and 2) Public experience. These strategies represent an additional opportunity
to insert codified data quality practices and policies into a broader strategy.
Strategy 2.2. Conduct targeted data quality improvements. During the middle of last year,
we adopted this as a new strategy. Our central position in the City allows us to identify
cross-department data quality concerns. As a result, we will occasionally participate in and even
lead, if needed, a targeted effort to improve data quality. While this strategy is no substitute for a
broader strategy, it can fill certain critical data gaps.
Strategy 2.3. Provide mechanisms to elicit and track feedback and learnings from data
users. We discovered in Year 1 that we had a paucity of feedback mechanisms. While creating
our help desk was a first step, we need richer and more scalable approaches for user feedback.
Some of these we expect from our vendor, but others may require new tools, partnerships, or
types of engagements. New tools may include testing social data dictionaries or data wiki
pages. And we must also explore offline options for engagement (e.g. working groups).
Data in San Francisco: Meeting supply, spurring demand - Return to Top 22 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
23/40
Goal 3.Support increased use of data in decision-making
Once data is available, we need to use it. Effective use consists of individual and departmentcapacity as well as a broader public capacity for using data in decision-making. Capacity
consists of shared data and access, as well as data literacy, analytics, managing with data,
and displaying and communicating data. We need to match the availability of data with the
capacity to use data, both in terms of people and technology.
Subgoal 3.1 Increase internal capacity
Strategy 3.1. Grow Data Academy and explore methods to institutionalize as part of
professional development. Last fall we launched the Data Academy in partnership with the
City Services Auditor (CSA). The demand for courses has been high with every course at
capacity and with a waitlist. For Year 2, we want to add classes, bring in external trainers, and
explore ways to leverage massive open online courses. Part of the curriculum extension will be
to incorporate classes that are targeted at managerial and leadership roles. In addition, we want
to explore integrating Data Academy courses into formal training venues or as part of job series.
Viewing data literacy as a professional development strategy versus a series of ad hoc trainings
will be key to transforming data capacity across the City - at both department and individual staff
levels.
Strategy 3.2. Provide enduring materials and resources for data tools and techniques.
While the Data Academy provides an opportunity for direct training, we want to supplement that
with enduring resources that are available outside of the classroom and to serve a broader
audience. In particular, we will explore how to showcase tools or other resources and provide
supplementary materials, e.g. guidance or tool guides. A particular focus will be on
geographic/mapping tools as well as data quality tools. This could also include exploring means
to better distribute previous analyses or work.
Strategy 3.3. Help establish department stat programs based on department readiness.
We will continue to partner with the City Services Auditor and the strengthened Performance
Management team within the office. We expect to be in a supporting and partnering role and will
focus on enhancing or extending their work, not leading.
Strategy 3.4. Explore opportunities to supplement analytical capacity. While the City has agreat deal of analytical talent, we are interested in enhancing both the amount and type of
analytical capacity. Opportunities may exist for partnerships with external organizations, working
with volunteers, issuing challenges or enhancing existing staff.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 23 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
24/40
Subgoal 3.2 Support public capacity
Strategy 3.5. Continue to develop our portfolio of transparency tools and websites.
Transparency tools and websites go beyond simply publishing data to transforming the data intoinformation that can be consumed and understood by the general population. The Housing Data
Hub is one example. Each of these tools provides policy makers and the public with ready
access to City data contextualized and presented in a way that informs decision-making.
Typically, these sites are built on open data. We will continue to develop our own sites as well
as partner and/or promote sites being built by City departments.
Strategy 3.6. Explore methods to increase public capacity for data use. Transparency
websites are one form of capacity building, but they rely on a single channel, a website, to
engage the public about City data. We are interested in exploring other methods, whether it is
trainings at the Library, workshops at community or neighborhood events, or collaborative
problem-solving. We expect any additional methods will also increase our own capacity topresent the City’s data more effectively and to be more responsive to the broader community.
Subgoal 3.3 Foster and incent a data culture
Strategy 3.7. Explore the creation of shared frameworks for data and evaluation. A
common language and approach to data-driven decision making can help set a roadmap and
ease the effort needed from departments. For example, imagine if any new initiative included a
data, evaluation and performance management strategy. This goes beyond simply requiring
evaluation to the continuous measuring and retooling of policies and programs based on astream of real time data and experimentation integrated into program management and
processes. Instead of a pre/post appendage - data and evaluation is part of the team.
We will explore creating a shared framework to inform the launch of new programs, including
defining key outcomes, the data and evaluation plan, and performance management needs. For
example, a data plan could address data sourcing and collection needs, data sharing
requirements and data model creation. It could also address how to integrate data needs into
business processes and technical systems. Lastly, it could discuss how to create management
tools, including measures, dashboards, staffing and business processes.
The framework could be implemented or tested in a variety of ways from pilots to training to
policy.
Strategy 3.8. Explore creation of data-related peer networks. Data-related peer networks
could help foster cross-department problem solving by connecting colleagues with related
domain expertise. Employees could share ideas for data use and tools and also identify
opportunities to collaborate on cross-department data initiatives.
Strategy 3.9. Communicate the benefits of data-driven decision-making. Clarifying the
Data in San Francisco: Meeting supply, spurring demand - Return to Top 24 of 40
http://housing.datasf.org/http://housing.datasf.org/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
25/40
value of data-driven decision-making and the tangible benefits requires storytelling. And within
the City, we’ve heard that one of the primary drivers to adopt Stat programs was hearing what
other groups are doing. We need to be better at collecting and communicating stories about
effective data use. Not only does this spur new ideas, it showcases the teams that are doing
good work, thereby encouraging more.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 25 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
26/40
Goal 4.Identify and foster innovations in open data and data use
The pace of change in the open data, analytics, and visualization spaces is breathtaking. Weneed to not only ensure we are aware of innovations, but we need to selectively identify and
nurture innovation in order to ensure that the City and our stakeholders benefit from changes
in technology and the experiences of others.
Strategy 4.1. Maintain ongoing reviews of best practices and the changing technology
landscape. To ensure that San Francisco maintains its leadership position in open data, we
have to stay abreast of emerging best practices and changes in technology that can better
support or even transform our program. In part, this will be a natural result of our
communications and engagement strategy, but retaining it as a specific strategy will help ensure
that we are making regular and conscious efforts to assess the rapidly changing landscape.This approach was validated in Year 1, as our quarterly technology landscape sessions resulted
in several pivots or technology changes.
Strategy 4.2. Target opportunities to improve data-centric services. The City provides a
variety of services and some of these are heavily mediated by data and/or technology and may
be cross-departmental. Our experience in Year 1 showed that we have a role to play in guiding
or informing these types of projects. As this type of work risks stretching our capacity, we will
have criteria for participating, including expected impact and level of departmental resources
and commitment. Wherever possible, we will roll the projects or the lessons learned into the
larger shared services and public experience strategies discussed in Strategy 1.10.
Strategy 4.3. Selectively partner in or promote data-centric initiatives. Through our
engagement strategy and ongoing reviews we hope to identify opportunities for targeted data
initiatives or partnerships that involve organizations or people outside of the City. We believe
external organizations or perspectives may bring a new approach to existing City challenges or
help extend City services. We will also seek opportunities to collaborate with other
governments. Part of this work will be to develop clear criteria on when and how we should
participate in partnerships as well as methods to elicit external help.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 26 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
27/40
Goal 5.Continuously improve, scale, maintain and monitor our work
A culture of continuous improvement ensures that we always work to identify where and howwe can improve. In some cases, this may be a deliberate choice to not improve if the benefits
are less than the effort required. In addition, due to the small size of our team, we need to
deliberately seek ways to scale our work both in execution and impact. During each project or
activity, we continuously ask ourselves - can we scale this? If the answer is no, we need to
change or on occasion, limit our effort. Lastly, any work we have accomplished needs a
deliberate maintenance strategy if we have future need for it.
Activity 5.1. Maintain data catalogs. The dataset inventory that we completed in Year 1 was
an enormous undertaking. We need to maintain the resulting list so we can use it to broadly
facilitate internal data access and to track data as it changes over time.
Activity 5.2. Maintain, and iterate as needed, methods for prioritizing datasets. We will
need to fully deploy and then maintain our various methods for prioritizing the publication of
datasets. If new methods emerge, we will incorporate them into our plan.
Activity 5.3. Continuously improve our web presence and supporting processes and
materials to better meet the needs of our users. While we will seek to increase the means in
which we engage users, our website and supporting tools will likely remain the key point of
interactions. As such, we must ensure that they are meeting the needs of our many users,
including data publishers, consumers and residents.
Activity 5.4. Continue to partner with Socrata to inform the development of the portal. SFOpenData, our data portal, is a key part of our web presence and how we meet the needs of our
users. We will continue to partner with our open data portal vendor to incorporate our user’s
needs into the portal’s roadmap.
Activity 5.5. Continuously improve outreach and support for Data Coordinators and
publishers. We need to continue to support our Data Coordinators. We do expect our support
of data publishers to increase in Year 2 both due to the expected increase in publication post
dataset inventory and to continuously improve the publishing process.
Activity 5.6. Grow and broaden communications and engagement activities. In Year 1, our
communications and engagement was focused largely on completing the dataset inventory andengaging our Data Coordinators. In Year 2, we must grow the scope and nature of our outreach.
Not only did our survey suggest we are not reaching key parts of City staff, we know that we are
not engaging most neighborhoods and communities writ large. Now that we have the key digital
channels in place (social media, blog, website), we can build and extend our work. The core
goal in this strategy is to broaden awareness and then use of the tools we are providing.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 27 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
28/40
Activity 5.7. Track and measure our progress. In Year 1, we established a framework and set
of metrics for tracking our work. We need to maintain that work, automate reporting wherever
possible, and make changes as our work evolves. In addition, this requires some level of
conscious data collection, whether through surveys, workshops or case studies.
Activity 5.8. Conduct ongoing planning. To ensure our work is on track, we must conduct
ongoing planning. Last year we established monthly and quarterly planning meetings that
ensured we were meeting the goals of our workplan or if needed, reevaluating our approach.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 28 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
29/40
5. Priority, Resource and Contingency AnalysisThe Open Data Ordinance mandates some of our activities, while others are either in the critical
path for broader work or a key part of setting a platform for future success. As a result, we
prioritized our strategies using the MoSCow method in the context of what we think we must
accomplish in Year 2 (M=Must, S=Should, C=Could). This does not mean that certain activities2
will not become “musts” or “shoulds” over time.
We then identified resource gaps as follows:
● No - no resource gap
● Yes - we do not believe we can be successful with existing resources
● Partial - the strategy can be supported at some level with current resources, but should
be supplemented to ensure success
We then characterized the gap based on type of need:
● Ongoing - requires a sustainable resource plan as we expect to be actively developing
or maintaining this activity over the mid to long term
● Project - requires a one time solution to resource
Lastly, the table includes a brief contingency strategy if we are unable to close the resource
gap.
Table: Prioritization, Gap Analysis and Contingency Plan
Strategy M S C Gap
Type of
Need
Contingency Strategy if Unable
to Close Gap
Strategy 1.1. Continue to mature
our program to automate publication
of data.
X Partial Ongoing We plan to close this gap by hiring
a new role for open data services.
This gap will exist until we completethe hire and onboarding. If we are
unable to hire the right mix of skills
we will plan to reallocate
responsibility among existing staff
and selectively partner with a
handful of departments with related
expertise.
Strategy 1.2. Develop self-service
model for data automation for large
departments.
X Partial Ongoing
Strategy 1.3. Target departments
for wholesale data automation.
X Partial Project
Strategy 1.4. Develop a geographic
data access and publishing
strategy.
X Partial Ongoing
Strategy 1.5. Establish methods to
ensure SF licensing and publication
of data for new information systems
X Partial Project We will seek external and internal
partners for help developing this.
Strategy 1.6. Create “ShareSF” hub
and develop supporting resources
and business processes.
X Partial Ongoing We plan to hire later this year and
that will partially close the gap; We
will also rely on key department
partnerships and will seek external
funding.
2 MoSCoW prioritization is traditionally used in software development to determine what requirements you
Must have, Should have, Could have, and Won’t have. In our case, we used it to prioritize our activities.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 29 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
30/40
Strategy 1.7. Explore technical
solutions for confidential data
sharing.
X Partial Ongoing We will scale to our capacity and
may seek external funding.
Strategy 1.8. Create a process for
accessing your individual data.
X Partial Project We will rely on interns and partner
with the Public Information Officers
to complete this.
Strategy 1.9. Develop methods to
connect internal users to datasets.
X Partial TBD We will tailor sub-projects to scale
to our capacity and will reexamine
based on need.
Strategy 1.10. Integrate internal
data access needs into emerging
technology strategies.
X No
Strategy 1.11. Explore options to
develop shared data systems for
internal use.
X No
Strategy 2.1. Develop
comprehensive data quality strategy
for the City; implement via pilots
and broader COIT strategies
X Partial Ongoing We will seek department partners
and we may seek external funding.
Strategy 2.2. Conduct targeted data
quality improvements.
X Partial Ongoing We will only engage in projects with
department engagement and
resource commitment.
Strategy 2.3. Provide mechanisms
to elicit and track feedback and
learnings from data users.
X Partial Ongoing We will scale for our capacity and
seek external partners to help
frame and move this forward.
Strategy 3.1. Grow Data Academy
and explore methods to
institutionalize as part of
professional development.
X Partial Ongoing We will seek to expand our
department partnership to HR and
also explore bringing in external
teachers for advanced topic areas.
Strategy 3.2. Provide enduring
materials and resources for data
tools and techniques.
X Partial Ongoing We will scale subprojects based on
our capacity and encourage
departments to contribute.
Strategy 3.3. Help establish
department stat programs based on
department readiness.
X No
Strategy 3.4. Explore opportunities
to supplement analytical capacity.
X Partial Ongoing We will scale for our capacity and
seek external funding.
Strategy 3.5. Continue to develop
our portfolio of transparency tools
and websites.
X Partial Ongoing We will scale for our capacity, seek
external funding, and require
committed department partners.
Strategy 3.6. Explore methods to
increase public capacity for data
use.
X Partial Ongoing We will scale for our capacity and
seek external funding.
Strategy 3.7. Explore the creation of
shared frameworks for data and
evaluation.
X Partial Ongoing We will scale for our capacity, seek
external funding, and seek
department partners.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 30 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
31/40
Strategy 3.8. Explore creation of
data-related peer networks.
X No
Strategy 3.9. Communicate the
benefits of data-driven
decision-making.
X Partial Ongoing We will scale for our capacity and
may seek external funding or
partnerships.
Strategy 4.1. Maintain ongoingreviews of best practices and the
changing technology landscape.
X No
Strategy 4.2. Target opportunities to
improve data-centric services.
X Partial Ongoing We will scale for our capacity and
require committed department
partners.
Strategy 4.3. Selectively partner in
or promote data-centric initiatives
X Partial Ongoing We will scale for our capacity and
may seek external funding or
partners.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 31 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
32/40
6. ConclusionData can feel dry, boring and academic. At the same time, everyone loves a good story. But
every story has a rich vein of data threaded throughout, describing a pattern and illuminating a
path forward. It’s only when we link the data narratives that underlie our stories that we are able
to make new connections that lead to new insights about what is working or what is possible.This plan is not about data for data’s sake. This plan is about transforming how we enrich our
understanding, our experience and our City with data.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 32 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
33/40
Appendices
Appendix A. Acknowledgements
A number of people, too numerous to list, have contributed to our work, our thinking and our
inspiration. Below are a handful of thanks - we may have missed some, if so, our apologies!Our local brigade, Code for San Francisco, has become a fantastic partner - we learn from and
with them and value the relationships that have developed. Thank you especially to Jesse
Biroscak, Maddie Suda, Julio Feliciano, Judy van Soldt, and Katherine Nemacher.
Many thanks to my colleagues in other places for sharing their worries, their challenges and
their solutions. I love that we are on this journey together! Barbara Cohn, Stuart Drown, Laura
Meixell, Abhi Nemani, Andrew Nicklin, Maksim Pecherskiy, Tom Schenk, and Tim Wisniewski.
Our Internal Advisory Group provided guidance and strategic direction. Many thanks to Carmen
Chu, Miguel Gamiño, Luis Herrera, Kate Howard, Steve Kawa, Ed Reiskin, and Ben Rosenfield.
The following people have become friends and thought partners throughout this process: Anthony Ababon, Krista Canellakis, Cyndy Comerford, Ted Conrad, Jason Cunningham,
Rebecca Foster, Luke Fretwell, Jane Gong, Kate Howard, Chanda Ikeda, Matthias Jaime, Lani
Kent, Kelly Kirkpatrick, Carol Lu, Andy Maimoni, Ashley Meyers, Jay Nath, Tajel Shah, Chris
Simi, Peg Stevenson, John Tucker, Marisa Pereira Tully, and Melissa Whitehouse.
The following folks have been key parts of making everything happen. Their insight,
commitment, and persistence have helped all we have done be successful this year: Jason
Lally, Jeff Johnson, Samuel Valdez, Sherman Luk, Jessie Rubin, Andrew Ju, Kyle Patterson,
Laura Marshall and Kyra Sikora. Read more about their contributions here:
http://datasf.org/about/.
And we’ve had an amazing stream of interns who have been critical to so many projects. Thank
you each for your energy and commitment: Peri Weisberg, Erica Finkle, Laura Gerhardt,
Christina Malamut, Charlotte Hill, Dan Wilcox, Evgenia Likhovtseva, and Marcelo Milanello.
Read more about their contributions here: http://datasf.org/about/.
Last, but so far from least our Data Coordinators - our core activity and output from Year 1
would not exist without the collective effort of our Data Coordinators and other supporting staff:
Mullane Ahern, Darrell Ascano, Colleen Burke-Hill, Carol Chapman, Eddy Ching, Mike Choi,
Joanne T. Chou, Marina Coleridge, Robert Collins, Elise Crane, Keith DeMartini, Matt Dorsey,
Sarah Duffy, Tiarra Earls, Kevin Edwards, Penni Eigster, Sandra Eng, Cheong-Tseng Eng,
Cynthia Goldstein, Zihong Gorman, Brandon Grissom, Michele Gutierrez-Canepa, John Halpin,David Hardy (LT), Kurian Joseph, Jennifer (Zoey) Kroll, Michael Lambert, Craig Lee, Alexander
Levitsky, Brent Lewis, Thomas Lindman, Ferry Lo, Jose Luis Perla, Andy Maimoni, Maria X
Martinez, Steven Massey, Eddie McCaffrey, Maria McKee, Jesus Mora, John Murray, Wilson
Ng, Stephanie Nguyen, Eric Pawlowsky, Jeff Pera, Joshua Raphael, Stacy T. Robson,
Guillermo Rodriquez, Leah Rothstein, Ken Salmon, Valeri Shilov, Mitch Sutton, Marianne
Thompson, Charles Thompson, Anne Trickey, Alan Tse, Tyler Vu, Mike Webster, Chris
Data in San Francisco: Meeting supply, spurring demand - Return to Top 33 of 40
http://datasf.org/about/http://datasf.org/about/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
34/40
Wisniewsky, Gloria Woo, Mike Wynne, and Theresa Zighera.
Appendix B. Detailed Accomplishments in Year 1
For each of our goals and strategies from Year 1, we highlight the key accomplishments and
status for each strategy. Our quarterly milestones (google doc) represents an accounting by
quarter of our milestones and our progress on them per our strategic plan. The quarters coverFiscal Year 2014-2015, which started on July 1, 2014 and completed on June 30, 2015. Below
is a high level summary for each strategy by goal.
Goal 1: Increase number and timeliness of datasets on DataSF
Strategy Key Accomplishments / Status
Strategy 1.1. Establish the role of data
coordinators and support development of data
catalogs.
● 52 data coordinators appointed
● (75%) of department inventories complete
● Inventory published on SF OpenData
Strategy 1.2. Develop methods to inform the
prioritization of datasets for publication.
● Developed 4 methods to prioritize datasets
● Streamlined data nomination process and
deployed help desk and request tracking
Strategy 1.3. Develop metrics to track and
measure progress in publishing open data.
● Developed progress measures and KPIs to
support quarterly report
● Developed evaluation framework for measuring
impact of open data
● Coming soon: Public launch of department
publishing plans and automated reporting
Strategy 1.4. Develop our program to
automate publication of data.
● Developed business case in partnership with DT
● Created program and services model
● Established technology, business processes and
support documents
● Secured full time resource for program
Strategy 1.5. Develop an outreach and
support program for data coordinators and
other data publishers.
● Created Data Coordinator Portal and supporting
tools, templates and training
● Created Publisher Portal with standard publication
process and training
● Developed a submission process and packet
● Created series of guidebooks and conducted in
person and online trainings
Strategy 1.6. Establish methods to ensure SF
licensing and publication of data for new
information systems.
● In progress, project was delayed due to lack of
best practices and resource constraints
Goal 2: Improve usability of DataSF
Strategy Key Accomplishments / Status
Strategy 2.1. Better leverage existing services
and features from Socrata.
● Conducted analysis and rolled into other strategies
Strategy 2.2. Partner closely with Socrata to
inform the development of the portal.
● Joined customer advisory board
● Participated in usability testing of new key feature
● Actively participate in roadmap and direction
● Participate in monthly roadmap meeting
Data in San Francisco: Meeting supply, spurring demand - Return to Top 34 of 40
https://docs.google.com/spreadsheets/d/1M30oyAFUO6TXkmZ1jqGXTNGdvsTiXh5V7oS7vUCKRJ0/edit?usp=sharing
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
35/40
Strategy 2.3. Redesign our web presence and
supporting processes and materials to better
meet the needs of our users.
● Redesigned and launched new data portal
● Launched new web home
● Early partner in technology preview for new
dataset design
Goal 3: Improve the usability, quality, and consistency of our data
Strategy Key Accomplishments / Status
Strategy 3.1. Establish metadata standards for
published data.
● Created and implemented new standard
Strategy 3.2. Establish mechanisms to elicit
and track feedback and learnings from data
users.
● Analysis suggested gap; will deploynew methods
in Y2
Strategy 3.3. Explore the creation of data
quality processes and measures.
● Conducted research and laid out approach for Y2,
including a data playbook and identified initial
partners or topics for pilots
*NEW* Strategy 3.4 Conduct targeted data
quality improvements
● Worked to incorporate inclusionary housing
program data needs into upstream planningbusiness process; turned into broader housing
data pipeline project that will extend into Y2
*NEW* Strategy 3.5 Reset and standardize
datasets on DataSF
● Complete and in monitoring mode
● Created standard guidelines
Goal 4: Enable use of private data, while appropriately protecting it
Strategy Key Accomplishments / Status
Strategy 4.1. Create a data classification and
sharing standard.
*REVISED* Develop a strategy to enable
internal data sharing
● In partnership with CSA, convened departments
in HSS and developed a multi-year strategy
● Obtained dedicated resources to support going
forwardStrategy 4.2. Create a process for accessing
your individual data.
● Modified strategy to leverage existing processes;
will deploy in Y2
Goal 5: Support increased use of data in decision-making
Strategy Key Accomplishments / Status
Strategy 5.1. Establish a training curriculum to
support increased use of data in
decision-making.
● In partnership with CSA, Launched Data
Academy in Fall, all classes booked out with
waiting lists
● Done department trainings after being
approached by depts
Strategy 5.2. Help establish department statprograms based on department readiness;
codify lessons learned and materials for
broader use
● Partnered with CSA to:○ Develop 2 case studies of department
Stat programs
○ develop assessment tool and guidebook
to creating stat programs
● Piloted approach in department, which is in
progress
● Will launch cumulative work as the Stat Starter
Kit in early Y2
Data in San Francisco: Meeting supply, spurring demand - Return to Top 35 of 40
http://datasf.org/publishing/guidelines/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
36/40
Strategy 5.3. Continue to develop our portfolio
of transparency tools and websites.
● Developed the Housing Data Hub in partnership
with multiple departments
Goal 6: Identify and foster innovations in open data and data use
Strategy Key Accomplishments / Status
Strategy 6.1. Develop and maintain a communicationsand engagement strategy.
● Conducted analysis and plan● Increased twitter following
● Established blog
● Created CDO listservs
Strategy 6.2. Conduct ongoing reviews of best
practices and the changing technology landscape.
● Conducted review and codified quarterly
process
Strategy 6.3. Identify and enable targeted data-centric
initiatives.
● Working to automate and analyze housing
inspections data from 3 departments; will
explore extension of work in Y2
Strategy 6.4. Establish a data licensing framework
and standard.
● Completed analysis and made
recommendation
● Obtained legal agreement withrecommended standard
● Rollout and transition strategy underway
Data in San Francisco: Meeting supply, spurring demand - Return to Top 36 of 40
http://housing.datasf.org/
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
37/40
Appendix C. Quarterly Milestones for Year 2
For each of our strategies, we outline a set of quarterly milestones and expected resources.
Adjustments to the milestones may occur based on resources or other factors as discussed in
Section 5. You can view the milestones and related timeline in a google spreadsheet.
Appendix D. Crosswalk between plan and Open Data PolicySec. 22D.2. Chief Data Officer and City Departments
(a) Chief Data Officer
# Clause Implementation
(a) Chief Data Officer. In order to coordinate implementation, compliance, andexpansion of the City's Open Data Policy, the Mayor shall appoint a ChiefData Officer (CDO) for the City and County of San Francisco. The CDOshall be responsible for drafting rules and technical standards to implementthe open data policy, and determining within the boundaries of law whichdata sets are appropriate for public disclosure. In making this determination,the CDO shall balance the benefits of open data set forth in Section 22D.1,
with the need to protect from disclosure information that is proprietary orconfidential and that may be protected from disclosure in accordance withlaw. Nothing in the rules and technical standards shall compel or authorizethe disclosure of privileged information, law enforcement information,national security information, personal information, unless required by law.Nothing in the rules or technical standards shall compel or authorize thedisclosure of information which is prohibited by law.
This document serves tomeet the generalexpectations. Subgoal 1.2will protect proprietary orconfidential information.
(b) The CDO's duties shall include, but are not limited to the following: -
(b)(1) Draft rules and technical standards to implement the open data policy
ensuring the policy incorporates the following principles:
(b)(1)(A) (A) Data prioritized for publication should be of likely interest to the public; Deployed via Strategy 1.2in FY14-15; Maintained
via Activity 5.2 inFY15-16.
(b)(1)(B) (B) Data sets should be free of charge to the public through the web portal; Existing practice
(b)(1)(C) (C) Data sets shall not include privileged or confidential information, law
enforcement information, national security information, personal
information, proprietary information or information the disclosure of which is
prohibited by law; and
Managed via publicationprocess and Subgoal 1.2.
(b)(1)(D) (D) Data sets shall include, to the extent possible, metadata descriptions,
API documentation, and the description of licensing requirements. Common
core metadata shall, at a minimum, include fields for every dataset's title,
description, tags, last update, publisher, contact information, unique
identifier, and public access level as defined by the CDO.
Complete and managedvia publication process.
(b)(2) (2) Coordinate, maintain, and update the City's Open Data website,
currently known as "DataSF";
See Activity 5.3.
(b)(3) (3) Present the Open Data rules and technical standards to the
Committee on Information Technology (COIT) for adoption;
COIT is the forum used topass rules and technicalstandards.
(b)(4) (4) Provide education and analytic tools for City departments to improve
and assist with the release of open data to the public;
See Strategies 1.1, 1.2,1.3, 1.4, and Activity 5.5.
Data in San Francisco: Meeting supply, spurring demand - Return to Top 37 of 40
https://docs.google.com/spreadsheets/d/1rxXSf6Rsu2tJxV9gEnRWuVzcbfSmJrpwj-b_T8u8TWs/edit?usp=sharing
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
38/40
(b)(5) (5) Assist departments by collecting and reviewing each department's
open data implementation plans and creating a template for the
departmental quarterly progress reports;
Complete and maintainedvia Activity 5.5.
(b)(6) (6) Present an annual citywide implementation plan to COIT, the Mayor,
and Board of Supervisors and respond, as necessary, to inquiries regarding
the implementation of the open data policy and the compliance ofdepartments with the deadlines established in this section.
This plan will bepresented to all of thesegroups.
(b)(7) (7) Help establish data standards within and outside the City through
collaboration with external organizations;
New standards will bedeveloped as needed.
(b)(8) (8) Assist City departments with analysis of City data sets to improve
decision making;
See Goal 3
(b)(9) (9) Establish a process for providing citizens with secure access to their
private data held by the City;
See Strategy 1.8
(b)(10) (10) Establish guidelines for licensing open data sets released by the City
and evaluate the merits and feasibility of making City data sets available
pursuant to a generic license, such as those offered by "Creative
Commons." Such a license could grant any user the right to copy, distribute,
display and create derivative works at no cost and with a minimum level of
conditions placed on the use; and,
Complete, will formalizevia COIT standard.
(b)(11) (11) Prior to issuing universally significant and substantial changes to rules
and standards, solicit comments from the public, including from individuals
and firms who have successfully developed applications using open data
sets.
Standard practice; Rulesand standards will also bepresented to COIT, apublic forum
(b) City Departments
# Clause Implementation
(b) Each City department, board, commission, and agency ("Department")
shall:
-
(b)(1) Make reasonable efforts to make publicly available all data sets under the
Department's control, provided however, that such disclosure shall be
consistent with the rules and technical standards drafted by the CDO and
adopted by COIT and with applicable law, including laws related to privacy;
Supported by Strategies1.1-1.4 and Activity 5.5.
(b)(2) Review department data sets for potential inclusion on DataSF and ensure
they comply with the rules and technical standards adopted by COIT;
Complete and maintainedby Activity 5.5.
(b)(3) Designate a Data Coordinator (DC) no later than three months after the
effective date of Ordinance No. 285-13, who will oversee implementation
and compliance with the Open Data Policy within his/her respectivedepartment. Each DC shall work with the CDO to implement the City's open
data policies and standards. The DC shall prepare an Open Data plan for
the Department which shall include:
Complete
(b)(3)(A) A timeline for the publication of the Department's open data and a summary
of open data efforts planned and/or underway in the Department;
Publication plans arepublicly available andupdated bi-annually.
(b)(3)(B) A summary description of all data sets under the control of each Complete other than
Data in San Francisco: Meeting supply, spurring demand - Return to Top 38 of 40
8/20/2019 Data in San Francisco: Meeting supply, spurring demand
39/40
Department (including data contained in already-operating information
technology systems);
rolling acceptances fromdepartments; DataInventory available on SFOpen Data.
(b)(3)(C) All public data sets proposed for inclusion on DataSF; See previous
(b)(3)(D) Quarterly updates of data sets available for publication. Centralized throughpublishing program
(b)(4) The DC's duties shall include, but are not limited to the following:
(b)(4)(A) No later than six months after the effective date of Ordinance No.285-13,
publish on DataSF, a catalogue of the Department's data that can be made
public, including both raw data sets and application programming interfaces
("API's").
Complete, thoughaccepting rollingsubmissions
(b)(4)(B) Appear before COIT and respond to questions regarding the Department's
compliance with the City's Open Data policies and standards;
Will be done as needed
(b)(4)(C) Conspicuously display his/her contact information (including name, phone
number or email address) on DataSF with his/her department's data sets;
Supported by central helpdesk to facilitate trackingand formalized via
Strategy 2.3.
(b)(4)(D) Monitor comments and public feedback on the Department's data sets on a
timely basis and provide a prompt response;
See previous
(b)(4)(E) Notify the Department of Technology upon publication of any updates or
corrective action;
Existing practice
(b)(4)(F) Work with the CDO to provide citizens with secure access to their own
private data by outlining the types of relevant information that can be made
available to individuals who request such information;
See Strategy 1.8
(b)(4)(G) Implement