It’s all about the data: A Managerial Perspective...R.D.Damhof –Oktober 2014 –Norske Oh…data...

Preview:

Citation preview

R.D.Damhof – Oktober 2014 – Norske

It’s all about the data: A Managerial Perspective

By Ronald Damhof

Email: ronald.damhof@prudenza.nl

Linkedin:

nl.linkedin.com/in/ronalddam

hof/

Twitter: RonaldDamhof

Blog: prudenza.typepad.com

Website:www.prudenza.nl

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

I am an opinionated kind a guy….

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

I. Thou shall always respect

& consider the context.

Context is leading

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

II. Thou shall love your

(meta)data. Data is the

ultimate proprietary

asset:

- Manage it

- Govern it

- Utilise it

But do it ethically

“Most companies manage their

parking lot better than their data” —Gartner, Frank Buytendijk (paraphrased)

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

III.Thou shall stop centering

apps over data:data first

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

IV.Thou shall strive for

accurate, relevant, timely,

reliable and accessible data:

It is all about the quality of

the product

Deming’s point 3 of 14:

”Cease dependence on inspection to

achieve quality. Eliminate the need for

massive inspection by building quality

into the product in the first place."

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

V. Thou shall abstract

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

VI.Thou shall make a

‘fundamentalistic’ separation

between facts & context

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

VI.Thou shall not forsake

time

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

VIII.Thou shall uphold, improve

and teach the science and

practice of Information- &

data modeling

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

IX.Thou shall Specify,

Standardise, Automate &

Productise

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

Who am I - My Data ManifestoThe X commandments of data management

X. Thou can not buy your

way out of the data

misery you are in

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

‘XI’There is a new saviour in town. Its name is Hadoop

and it calls to us from its mountain:

‘we got a lake and thou shall throw all your data in

it. The water will be clean so you can drink it, the

water will flow so it will irrigate your lands, grow

your stock, feed your kids and of course bring you

world peace…..’

Who am I - My Data ManifestoThe X commandments of data management

R.D.Damhof – Oktober 2014 – Norske

R.D.Damhof – Prudenza BV - Copyright - 22 mei 2014R.D.Damhof – Oktober 2014 – Norske

R.D.Damhof – Oktober 2014 – Norske

R.D.Damhof – Oktober 2014 – Norske

Logistics & Manufacturing

R.D.Damhof – Oktober 2014 – Norske

R.D.Damhof – Oktober 2014 – Norske

Push/Supply/Source driven Pull/Demand/Product driven

Mass deployment Control > Agility Validation of “ingredients”

Repeatable & predictable processes Standardized processes High level of automation Relatively high IT/Data expertise

Piece deployment Agility > Control Plausibility User-friendliness Relatively low IT expertise Domain expertise essential

All facts, fully temporal Truth, Interpretation, Context

Business Rules Downstream

The Data Push Pull Point

R.D.Damhof – Oktober 2014 – Norske

Systematic

Opportunistic

User & developer are separated Defensive Governance Focus on non-functionals Centralised Proper system development

User = developer Offensive governance Decentralised “System development” in production

The Development Style

R.D.Damhof – Oktober 2014 – Norske

Development

Style

Systematic

Opportunistic

I II

III IV

Research,

Innovation &

Design

“Shadow IT,

Incubation,

Ad-hoc,

Once off”

Push/Supply/Source driven Pull/Demand/Product driven

Data

Push/Pull

Point

ContextFacts

A Data Deployment Quadrant

R.D.Damhof – Oktober 2014 – Norske

6 Applications of the Quadrant

R.D.Damhof – Oktober 2014 – Norske

(1) How we produce

R.D.Damhof – Oktober 2014 – Norske

How we produce, process variants

R.D.Damhof – Oktober 2014 – Norske

Production-line: Data orientation

Data Products Information Products

Access to data

Analytical tools

Processing Power

Production-line: Forms orientation

Eg. XBRL/JSON

How we produce, production lines

Production-line: Poly Structured

R.D.Damhof – Oktober 2014 – Norske

(2) How we automate

R.D.Damhof – Oktober 2014 – Norske

Rephrased - somewhat more nerdy:

• Model-driven, metadata driven

Or

• Declarative instead of Imperative

Rephrased - somewhat more popular:

“In Data, the developer is the data modeller”

(2) How we automate

R.D.Damhof – Oktober 2014 – Norske

(3) How we organize

R.D.Damhof – Oktober 2014 – Norske

To centralize or to decentralize

R.D.Damhof – Oktober 2014 – Norske

(4) How do people excel

R.D.Damhof – Oktober 2014 – Norske

Storage: (R)DBMS

Processing: Automation Software

Data Quality: Validation, Profiling

Development: Data Modeling

Accessibility: Data Virtualization

Storage: Pattern based

Processing: Automation/limited ETL

Data Quality: DQ rules/dashboards

User tooling: Reporting, dashboards,

Data Visualization

Storage: Analytical

Processing: Preptools for Data Analyst

User tooling: Advanced Analytics,

Data Visualization

(5) How about Technology

R.D.Damhof – Oktober 2014 – Norske

(6) Business-,Information- or

Data Modeling is key

Conceptual

Logical

e.g Data Vault,

Anchor Model

e.g. Dimensional,

hierarchical,flat

At least the Logical Model

drives the technical data

architecture, design and

implementation

Ontology

Facts

Relational

Natural Language

R.D.Damhof – Oktober 2014 – Norske

Oh…data warehouse?

• DWH in Netherlands - since 2007 - have increasingly been split-up between

facts (Quadrant 1) and context (Quadrant 2).

• Quadrant 1 is morphing into an ‘Integrated (Meta)Data Environment’. An

holistic view on data. Not only accepting feeds from other apps, but being

the system-of-record for apps.

• At least integrated on the logical level, preferably on the conceptual level.

• The (meta)data(quality) is fiercely managed & governed centrally in orgs.

• Interestingly; quadrant II is becoming the classic - more Kimball style DWH,

but where conformity is implicit and (technically) data virtualisation is key.

• Central Data Competencies, Decentral (close to demand) BI Competencies