rediscoveringBI - CDM Media[P16]Bringing BI and Big Data Together three things that make a “big”...

rediscoveringBI

After the big dAtA pArty04April 2013

issue 7

radiant Advisors publication

The Big DaTahoneyMoon

Over AlreAdy?

Bi anD Big DaTa

bringing them tOgether

Big DaTa vs.DaTa ManageMenT

Bi's BigQUesTion

A zerO-sum scenAriO

hAs the bubble burst?

2 • rediscoveringBI Magazine • #rediscoveringBI rediscoveringBI Magazine • #rediscoveringBI • 3

editor in Chief Lindy Ryanlindy.ryan@radiantadvisors.com

ContributorDr. Barry Devlinbarry@9sight.com

ContributorKrish Krishnanrkrish1124@yahoo.com

ContributorJohn O’Brienjohn.obrien@radiantadvisors.com

Distinguished WriterStephen Swoyerstephen.swoyer@gmail.com

art DirectorBrendan Fergusonbrendan.ferguson@radiantadvisors.com

For More information:info@radiantadvisors.com

frOm the editOr“Big data” is being routinely paired with proportionally “big” descrip-

tors: innovative, revolutionary, and even (in this editor’s humble opinion,

the mother-of-all-big-descriptors) transformative. With the inexorable

momentum of cyber-journalism keeping it affixed atop industry head-

lines, big data has indeed earned itself quite the reputation, complete

with a stalwart following of industry pundits, papers, and conferences.

In fact, the whole big-data-thing has mutated into a sort-of larger-

than-life caricature of promise and possibility – and, of course, power.

Let’s face it: big data is the Incredible Hulk of BI -- gargantuan, brilliant,

and, yes, sometimes even a bit hyper-aggressive. For many the mild-

mannered, Bruce Banner-esque data analyst out there, big data is, for

better or worse, the remarkably-regenerative, impulsive alter ego of the

industry, eager to show with brute force just how much we can really do

with all our data – what the tangible extent of all that big data power

is, so to speak.

Yet, as with the inaugural debut of Stan Lee’s destructive antihero’s in

The Incredible Hulk #1, in early 2013 we still haven’t begun to really see

what big data can do yet. Not even close.

In this month’s edition of RediscoveringBI, authors Dr. Barry Devlin, Krish

Krishnan, Stephen Swoyer, and John O’Brien each explore different fac-

ets of that very construct, asking has the big data bubble actually burst,

or is its honeymoon phase just over? How much is just hype? And, how

much is only a precursor to what we’ll continue to see buzzing around

and reinventing the industry?

What’s next for BI’s Incredible Hulk antihero, Big Data?

Lindy R

rediscoveringBI April 2013, issue 7

SPOTLIGHT

[P4]The Honeymoon is Over for Big Data big data, it turns out, means precisely nothing and imprecisely anything you

want it to mean.

[By Dr. Barry Devlin]

[P16]Bringing BI and Big Data Together three

things that make a “big” difference when

implementing big data.

[By John o’Brien]

[P10] Twilight of the (DM) Idols big data is already a

disruptive force: at once democratizing,

reconfiguring, and destructive.

[By stephen swoyer]

[P8] Has the Big Data Bubble Burst? the bi industry

is abuzz with one new question: is big

data done?

[By Krish Krishnan]

EDITOR’S PICK

[P7] Big Data Revolution Are we becom-

ing no more than sentient founts

of data? mayer-schönberger and

cukier put the pulse back in the

big data conversation.

[By Lindy Ryan]

[P13] Big Impact: Big Data two use cases for big

data are having a big impact, at least

from a data management perspective.

[P15] A Kludge Too Far? [By stephen swoyer]

rediscoveringBI

After the big dAtA pArty04April 2013

issue 7

radiant Advisors publication

The Big DaTahoneyMoon

Over AlreAdy?

Bi anD Big DaTa

bringing them tOgether

Big DaTa vs.DaTa ManageMenT

Bi's BigQUesTion

A zerO-sum scenAriO

hAs the bubble burst?

Lindy Ryan

Editor in Chief

Radiant Advisors

feAtures

sidebAr

4 • rediscoveringBI Magazine • #rediscoveringBI

Evolution vs. RevolutionThis is an excellent, thought-provoking article. I believe

that you are correct in the assertion that an Architectural

Reckoning is underway. In fact, I believe it has been underway

for at least 10 years.

To focus on technology in general and Hadoop in particular is,

however, to miss the point. The Reckoning is being driven by

the intersection of business needs and technology advances.

Both sides can be summed up as “faster and smarter” – and

they are mutually reinforcing. I call this the “biz-tech ecosys-

tem”. On the technology side, Hadoop is and will be part of

it. So will a range of other data management technologies,

including relational databases, for sure. And I believe that

the various approaches – column, in-memory and more – will

be combined into a hybrid approach more powerful than

any RDBMS we have today. And we will need that, because

I am certain that the Data Warehouse – in a new, more cir-

cumscribed role, but central to consistency and reliability

for information that must be of high quality – will continue

to thrive. (And many thanks for the historical positioning of

Paul’s and my paper from 1988!) I call this new role “Core

Business Information.”

As you also pointed out, it’s not just about data management.

What is happening is also changing application development,

as well as process and business modeling and implementa-

tion. Collaborative and social computing are also vital com-

ponents of the mix. So, yes, an inter-disciplinary approach will

be needed – not just within IT but across the business – IT

divide.

We are also in somewhat of a positive feedback loop – and as

anyone who has ever put a microphone in front of speakers

knows, the result rapidly becomes very unpleasant. So, we do

need to step back from the hype of big data and recognize the

dangers as well as the opportunities.

My bottom line: yes, we are in a time of Architectural

Reckoning (is this the same as a Paradigm Shift?) but continu-

ity of thinking and a mindset of evolution rather than revolu-

tion are vital. I’m trying to capture this in my long-awaited (by

me, anyway) second book.

- Dr. Barry Devlin (Editor's Note: See The Honeymoon is Over for

Big Data by Dr. Barry Devlin in this month's issue)

Augmentation of Traditional DWsI totally agree with Claudia that “not all analytics now belong

inside the BI architecture” and that we are in a “very disrup-

tive period of a lot of new technologies flooding in to busi-

ness intelligence.” I also am not actually that far away from

the position Scott Davis takes: I agree that “Hadoop is a huge-

ly transformational technology.” I just think for the short- to

medium-term Hadoop et al are going to augment, rather than

replace, traditional data warehouses. Will Hadoop replace a

traditional data warehouse database in the long term? Only if

it adds a lot of database like features, and then the argument

becomes a lot less interesting – something akin to the “Will

Ingres/Informix/Sybase replace Oracle?” debate of yesteryear.

My main concern is how customers are going to embrace this

new data landscape, rather than if they are going to. How are

organizations going to build a data landscape that includes

Teradata, Aster, and Hadoop? How are they going to manage

Analysis Services cubes and a smattering of legacy Oracle

data warehouses?

Data warehouses currently take too long to build and are too

hard to change. The new architectural changes are going to

make things worse not better.

Yes, WhereScape does have a stake in the game –although

not in the status quo. Regardless of the platform, design and

technology the need to deliver quickly without compromise

remains the same. Who wants to manually build out a mul-

tiple platform data warehouse? A data warehouse automation

environment (such as WhereScape RED) helps simplify the

approach, and I believe is a key piece of the new architecture.

- Michael Whitehead (Editor's Note: Michael Whitehead is the

CEO and Founder of WhereScape)

Agile and flexible -- those might well be the mantras of Modern Data Platforms. As

organizations look to harness the latest advances in analytics and integration technolo-

gies, the focus turns quite sharply to architecture: the right data platform can empower

companies to harness everything from Big Data to real-time, all without sacrificing data

quality and governance.

Register for this free Webcast to catch a preview of SPARK!: Modern Data Platforms, a

three-day seminar series to be held in Austin, TX, from April 29 - May 1. The seminar

will feature a tag-team of experts from Radiant Advisors and The Bloor Group, who will

provide detailed instruction on the range of activities associated with modernizing and

evolving robust data platforms. John O'Brien of Radiant will focus on Rediscovering BI,

while Dr. Robin Bloor of The Bloor Group will discuss the Event-Driven Architecture.

Attendees of the Webcast will receive a discount code for $150 off the in-person seminar.

rediscoveringBI

shifting gEARs with ModERn bi ARchitEctuREs03MARch 2013

issuE 6

Radiant Advisors Publication

EvEnt-drivEn architEcturEs

thE shifting lAndscAPE

timE ofrEckoning

arE data modEls dEad?

An ARchitEctuRAl

thE REAl dEbAtE

collision couRsEsElEctingthE rightbi solution

tying goAls to REquiREMEnts

http://www.bigdatabootcamp.net

Don't miss Radiant Advisors' John O'Brien as he keynotes the upcom-

ing Big Data Boot Camp.

May 21-22New York

HiltonJohn will offer perspective into the

dynamics and current issues being

encountered in today's Big Data

analytic implementations as well

as the most important and strate-

gic technologies currently emerg-

ing to meet the needs of the "Big

Data Paradigm."

Join John and other Big Data

experts as they converge upon New

York and be sure to save an extra

$100 off the early bird rate by

using this link. Early bird registra-tion ends April 19.

Have something to say? Send your letters to the editor at lindy.ryan@radiantadvisors.com

opInIon

letters tO the editOr

upcoming Webinarinside AnalysisMoDeRn DaTa PLaTFoRMsinside Analysis with dr. robin bloor and John O'brienhosted by eric Kavanagh

April17

3:00pM Cst

RegisteR Now

On: Time for An Architectural Reckoning

yes, we are in a time of Architectural reckoning but continuity of think-ing and a mindset of evolution rather than revolution are vital."

- dr. barry devlin

f o l l o w t h e

c o n v e r s a t i o n # s p a r k e v e n t

IG DATA IS tumbling into the

“Trough of Disillusionment,”

according to Gartner’s Svetlana

Sicular. If you fear that this

means the end of the road for big data,

see Mark Beyer’s (co-lead for Gartner

big data research) remedial education

on the meaning of Gartner’s Hype Cycle

curve, although they might have chosen

a less alarmist phrase!

Let me put it another way: the big data

honeymoon is over. Let’s quickly review

the history of the romance before look-

ing to the future of the relationship.

For commercial computing, big data

“dating” really began in the mid-2000s,

when technical people in the bur-

geoning web business began to con-

sider new ways to handle the exploding

amounts and types of data generated

by web usage. Before then, big data

had been the dream -- or nightmare,

actually -- of the scientific community

where, from genetics to astrophysics,

instrumentation was spewing data. In

early 2008, the commercial romance

of big data really began to get serious

when Hadoop, the yellow poster ele-

phant child of big data, was accepted as

a top-level Apache project. The market-

ing paparazzi began stalking the couple

soon after and, true to paparazzi nature,

have been publishing a stream of outra-

geous claims and pictures ever since. By

2012, a shotgun wedding with business

the is Over fOr big dAtA

spotlIght

dr. bArry devlin

was hastily arranged. By then the gloss had begun to wear off

and the honeymoon was washed out in a brief trip to Atlantic

City at the height of a super storm.

Enough Of The Past: Let’s Look Forward!Big data does offer real and realizable business benefits,

but there is one major issue: what actually is big data? The

“volume, variety, and velocity” nomenclature, claimed by Doug

Laney from a 2001 Meta Group research note, is useful short-

hand at best. In reality, each attribute opens up a question of

how far on any scale must data be in order to be called big

-- how vast, how disparate, how fast? Furthermore, what combi-

nation of these three factors should be used in making a call?

Big data, it turns out, means precisely nothing and imprecisely

anything the Mad Men want it to mean. And, with the various

additional “v-words” vaunted by vendors, the value vanishes.

(Oops, I veered into the v-v-verge there!)

The extent of this terminology problem was made clear in

a big data survey conducted last fall by EMA and myself.

Participants were those who declared they were investigat-

ing or implementing big data projects yet almost a third of

respondents classed the data source for their projects as

“process-mediated data” -- data originating from traditional

operating systems. My conclusion: the term big data has

passed its use-by date.

Big data and “small data” are conceptually one and the same:

just data, all data. Or, to be more semantically correct, all

information, as I’ll explain in a new book later this year. (Editor's

Note: Business Unintelligence: Via Analytics, Big Data and

Collaboration to Innovative Business Insight will be published in

Q3 2013 by Technics Publications).

To be clear, I don’t consider that big data has taken us into

a dead end. Rather, it has usefully exposed the fact that our

traditional business intelligence (BI) view of the information

available to and used by business is woefully inadequate. It

has caused me to revisit many underlying assumptions about

information and I now see that there exist three domains of

information that future business intelligence/analytics must

handle, as shown in in the accompanying figure: human-

sourced information, process-mediated data, and machine-

generated data. These domains are fundamentally different in

their usage characteristics and in the demands they place on

technology. The terms are largely self-explanatory, but more

information can be found in my white paper. (See Barry Devlin’s

The Big Data Zoo - Taming the Beasts: The need for an integrated

platform for enterprise information).

The bottom line is that we need a new architecture for infor-

mation -- all of it and its entire life cycle in business.

The Biz-Tech EcosystemBoth challenges and opportunities emerge as we shift the

view from IT to business.

The biggest challenge in the big data/analytics scene is the

alleged dearth of so-called “data scientists.” How different

are data scientists from the power users we’ve known in BI

for decades? Arguably, the only substantive difference is deep

statistical skill. The other characteristics mentioned -- data

munging, business acumen, and storytelling -- are all com-

mon to power users. Statistics, however, is a very specialized

skill that should, in principle, be tightly supervised to ensure

valid and proper application. The phrase “lies, damn lies, and

statistics” indicates the problem: statistics are far too easy

to misuse -- deliberately or otherwise. Moreover, we seem

to have blindly accepted an assertion that the exponential

growth in data volumes implies a similar growth in hidden

nuggets of useful business knowledge. This is unlikely to be

true. Most of the good examples of business value coming

from big data illustrate this. Real value emerges from a new

type or new combination of data; growth in volumes leads to

incremental increases in value, at best.

These challenges aside, a focus on novel (big) data use does

big data, it turns out, means precisely nothing and imprecisely anything”“[The term big data has passed its use-by date]

drive opportunities for new businesses, business models, or,

simply, ways to compete. A useful, cross-industry categoriza-

tion (courtesy of IBM) of these opportunities is:

• Big Data Exploration: analyze “big data” to identify new

business opportunities

• Enhanced 360° View of the Customer: incorporate human-

sourced information sources, such as call center logs and

social media, into traditional CRM approaches

• Security and Intelligence Extension: lower risk, detect fraud,

and monitor cyber security in real-time, machine-generated

• Operations Analysis: analyze and use machine-generated

data to drive immediate business results

• Data Warehouse Augmentation: increase operational effi-

ciency by integrating big data with BI

This focus on (big) data is but the latest stage in the evolu-

tion of what I call the biz-tech ecosystem -- the symbiotic

relationship between business and IT that drives all suc-

cessful, modern businesses. Every business advance worth

mentioning in the past twenty years has had technology, and

almost always information technology, at its core. On the other

hand, much of the advances in IT have been driven by busi-

ness demands. The relative roles of business and IT people

may change as the process evolves, but that process is set to

continue. And, at its heart are the collection, creation, and use

of information, as opposed to data -- big or small -- as manda-

tory, core competencies of modern business. Share your comments >

...at its heart are the collection, creation, and

use of information, as opposed to data -- big

or small -- as mandatory, core competencies of

modern business.”

Dr. Barry Devlin is Founder and Principal

of 9sight Consulting, and is among the

foremost authorities on business insight

and big data. He is a widely respected

analyst, consultant, lecturer, and author.

HILE IT’S INARGUABLE that the phenomenon

known as “big data” is rapidly reknitting the very

fabric of our lives, what we are just now begin-

ning to see and to understand – to appreciate

– is how.

Yet, so often our conversations about big data focus on these

“how’s” in the abstract – on its benefits, potentials, and oppor-

tunities, and likewise, its risks, challenges, and implications –

that we overlook the simpler, more primordial question: what’s

not changing?

It’s a simple question that requires a simple answer. Us. Sure,

we can assert that we’re becoming more data-dependent. We

generate more data: last month, social media giant Twitter

blogged1 that its over 200-million active users generate

over 400-million tweets per day. We consume more data: a

now-outdated University of California report2 calculated that

American households collectively consumed 3.6 zettabytes of

information in 2008. Are we – the data-generating organisms

that we are – becoming no more than sentient founts of data?

In Big Data: A Revolution That Will Transform How We Live,

Work, and Think, authors Viktor Mayer-Schönberger and

Kenneth Cukier effectively put the pulse back in the Big Data

Conversation: “big data is not an ice-cold world of algorithms

and automatons. . .we [must] carve out a place for the human:

to reserve space for intuition, common sense, and serendipity

to ensure that they are not crowded out by data and machine-

made answers.”

In our brief email exchange, Mayer- Schönberger elaborated

a bit more on this idea. “[We] try to understand the (human)

dimension between input and output,” he noted. “Not through

the jargon-laden sociology of big data, but through what we

believe is the flesh and blood of big data as it is done right

now.”

With the elegance of an Oxford University professor and

The Economist’s data editor – Mayer-Schönberger and Cukier,

respectively – Big Data’s authors remind us that it is our

human traits of “creativity, intuition, and intellectual ambi-

tion” that should be fostered in this brave new world of

big data. That the inevitable “messiness” of big data can

be directly correlated to the inherent “messiness” of being

human. And, most important, that the evolution of big data

as a resource and tool derives from (is a function of) the dis-

tinctly human capacities of instinct, accident, and error, which

manifest, even if unpredictably, in greatness. In that greatness

is progress.

That – progress – is the intrinsic value of big data. It’s what’s

so compelling about Big Data (both the book and the thing

itself): it’s not always about the inputs or outputs, but the

space – or, what Mayer-Schönberger calls the “black box,” of

in-between.

1 http://blog.twitter.com/2013/03/celebrating-twitter7.html2 How Much Information? http://hmi.ucsd.edu/howmuchinfo.php

Lindy Ryan is Editor in Chief

of Radiant Advisors.

lindy ryAn

Big Data is available on Amazon and

the Radiant Advisors eBookshelf

www.radiantadvisors.com/ebookshelf

editOr’s picK

big dAtA

Share your comments >

1. Build the business case and keep it simple

2. Create a data discovery environment that can be used

by line of business experts

3. Identify the data and patterns that are needed to

create a robust foundation for analytics

4. Create the initial analytics based on the data discovery

5. Visualize the data in a mash-up platform using

semantic data integration techniques

6. Get the business users to use the outcomes

7. Gain adoption of the users

8. Create a roadmap for the larger program

ECENT ARTICLES IN leading business publications, a

hype-cycle presentation by Gartner, and a number of

blogs have all startled the world of big data by asking

one “big” question: are we done? Did the big data

bubble burst even quicker than the “dot com” bubble?

Has the big data bubble burst?

The answer is: not really. If anything, the market for infra-

structure is booming with more vendors distributing com-

mercial versions of open source software (like Hadoop and

NoSQL). We are seeing the evolution of new consulting

practices focused on analytics and – perhaps most impor-

tant – traditional database vendors have all either embraced

or announced support for big data platforms. So, what is the

basis of this notion of failure or disappointment around the

big data space?

The Promised LandIn 2004, Google’s announcement of the general availability of

MapReduce and Google File System started a flurry of activ-

ity building platforms aimed at solving scalability problems.

One of these projects was “Nutch,” a parallel search engine

on the open source platform. The team at Nutch succeeded

in building the infrastructure that attracted Yahoo to sponsor

and incubate the project under its commercial name: Hadoop.

Submitted to open source in 2009, Hadoop quickly gained

notoriety as the panacea for all data scalability problems.

Since then it has become a viable platform for large-scale

computing needs and has been adopted as a data storage

and processing platform at many companies across the world.

Subsequently, the last four years have also seen the evolution

of NoSQL databases and multiple other additional technolo-

gies on the Hadoop framework.

The RealityHadoop’s early adopters did not fully understand the com-

plexities of the platform until they began implementing the

technology, and this lack of understanding inevitably has

spurred a sense of failure (or disappointment).

Among the potential gaps not understood clearly by adopters:

One size does not fit all: Big data technologies were devel-

oped to solve the problems of extreme scalability and sus-

tained performance. While these technologies have certainly

overcome the traditional limitations of database-oriented

data processing, the same techniques cannot be directly

extended to solve problems in the same realm.

MapReduce skill availability: To effectively use most of the

big data platforms one has to be able to write some amount

of Map Reduce code; however, this is an area where skills are

evolving and (still) scarce.

Programming dependence: Many corporations are unable to

adjust to the idea of having teams design and develop code

(or data processing) – much like application software devel-

opment. Standardization of programming techniques for big

data are still maturing.

Business case: Most early adopters did not have a robust

business case, or, in many cases, the right business case to

implement on these platforms. The lack of an end-state solu-

tion -- or usage and ROI expectations -- has led to longer

development and implementation cycles.

Hype: Continued hype about the technology has caused

unrest amongst executives, line of business owners, IT, and

business users, leading to often misunderstood capabilities

of the platform as well as incorrect ROI or TCO expectations.

But wait: it is not “all over” when we talk about big data,

rather we have come to the point in time where the reality of

the platform – and how to drive its adoption within corpora-

tions – has started settling down. The big data bubble is well

and alive; in fact, it’s even progressing in the right direction.

How to Integrate Big Data As corporations begin to see beyond the hype of big data,

everyone from the executive sponsor to the implementa-

tion team is beginning to recognize the need to dig a better

foundation for integrating big data. There are a few subtle yet

invaluable pointers in this process:

Features

hAs the Big DaTa BUBBLe burst?[The BI industry is abuzz with one new question: is big data done?]

Krish KrishnAn

While the overall process of big data integration seems

closely aligned to the integration of any other project, there

are key differences that can define the success of the big data

bubble in your corporation: data discovery, data analysis, and

data visualization. These three integral pillars will clearly

identify the basis of how to implement big data and monetize

such an exercise.

The FutureSeveral technology providers have announced their support

of big data platforms, including Datastax (Cassandra), Intel,

Microsoft, EMC and HP (Hadoop), 10Gen (Mongo DB), and

Cray (YARC Graph Analytics DB). These vendors -- along with

existing vendors -- will undoubtedly continue to provide more

options and solution platforms for deploying and integrating

big data technologies within the enterprise platform.

The big data bubble has not busted; it is still only begin-

ning and will be reaching various levels of maturity over the

following years. There are many layers of complexities and

intricacies that need to be defined and formalized, but this is

where the evolution and opportunities exist.

Krish Krishnan is a globally recognized

expert in the strategy, architecture, and

implementation of big data. His new

book Data Warehousing in the Age of Big

Data will be released in August 2013.

the big data bubble is well and alive; in fact, it’s even progressing in the right direction."

OME IN THE INDUSTRY ARE already writing epitaphs

for big data. Others – a prominent market watcher

comes to mind – argue that big data, like so many

technologies or trends before it, is simply conforming

to well-established patterns: following a period of hype, it’s

undergoing a correction. It’s regressing toward a mean.

That was fast.

This doesn’t concern us. Big data is an epistemic shift. It’s

going to transform how we know and understand — how we

perceive — the world. What’s meant by the term “big data” is

a force for destabilizing and reordering existing configura-

tions – much as the Bubonic Plague, or Black Death, was for

the Europe of the late-medieval period. It’s an unsettling anal-

ogy, but it underscores an important point: the phenomenon

of big data, like that of the Black Death, is indifferent to the

hopes, prayers, expectations, or innumerate prognostications of

human actors. It’s inevitable. It’s going to happen. It’s going to

change everything.

Even as the epitaphs are flying, the magic quadrants being

plotted, and the opinions mongering, big data is changing

(chiefly by challenging) the status quo. This is particularly the

case with respect to the domain of data management (DM) and

its status quo. Here, big data is already a disruptive force: at

once democratizing, reconfiguring, and destructive. We’ll con-

sider its reordering effect through the prism of Hadoop, which,

in the software development and data management worlds,

has to a real degree become synonymous with what’s meant

by “big data.”

[Big Data Vs. Data Management]

TwilighT of The (DM) idolsstephen sWOyer

The Citadel of Data ManagementBig data has been described as a wake-up call for data man-

agement (DM) practitioners.

If we’re grasping for analogies, the big data phenomenon

seems less like a wake-up call than.. .a grim tableau straight

out of 14th France.

This was the time of the Black Death, which was to function as

an enormous force for social destabilization and reordering. It

was also the time of the Hundred Years War, which was fought

between England and France on French soil. The manpower

shortage of the invading English was exacerbated by the viru-

lence of the Plague, which historians estimate killed between

one- to two-thirds of the European population. Outmanned

– and outwoman-ed, for that matter, once Joan D’Arc abrupted

onto the scene – the English resorted to a time-tested tactic:

the chevauchée. The logic of the chevauchée is fiendishly

simple: Edward III’s English forces were resource-constrained;

they enjoyed neither the manpower nor the defensive advan-

tages – e.g. , castles, towers, or city walls – that accrued (by

default) to the French. The English achieved their best out-

comes in pitched battle; the French, on the other hand, were

understandably reluctant to relinquish their fortifications,

fixed or otherwise.

The challenge for the English was to draw them out to fight.

Enter the chevauchée. It describes the “tactic” of rampag-

ing and pillaging – among other, far more horrific practices

– in the comparatively defenseless French countryside. Left

unchecked, the depredations of the chevauchée could ulti-

mately comprise a threat to a ruler’s hegemony: fealty counts

for little if it doesn’t at least afford one protection from other

would-be conquerors.

As a tactical tool, the chevauchée succeeded by challenging

the legitimacy of a ruling power.

Hadoop has had a similar effect. For the last two decades,

the data management (DM) or data warehousing (DW) Powers

That Be have been holed up in their fortified castles, dictating

terms of access – dictating terms of ingest; dictating time-

tables and schedules, almost always to the frustration of the

line of business, to say nothing of other IT stakeholders.

Though Hadoop wasn’t conceived tactically, its adoption and

growth have had a tactical aspect.

By running amok in the countryside, pillaging, burning, and

destroying stuff – or, by offering an alternative to the data

warehouse-driven BI model – the Hottentots of Hadoop have

managed to drag the Lords of DM into open battle.

At last year’s Strata + Hadoop World confab in New York, NY,

a representative with a prominent data integration (DI) ven-

dor shared the story of a frustrated customer that it says had

developed – perforce – an especially ambitious project focus-

ing on Hadoop.

The salient point, this vendor representative indicated, was

that the business and IT stakeholders behind the project saw

in Hadoop an opportunity to upend the power and authority of

the rival DM team. “It’s almost like a coup d’etat for them,” he

said, explaining that both business stakeholders and software

developers were exasperated by the glacial pace of the DM

team’s responsiveness. “[T]hey asked how long it would take to

get source connectivity [for a proposed application and] they

were told nine months. Now they just want to go around them

[i.e. , the data management group],” this representative said.

“[T]hey basically want Hadoop to be their new massive data

warehouse.”

The Zero-Sum ScenarioThis zero-sum scenario sets up a struggle for information

management supremacy. It proposes to isolate DM altogether;

eventually it would starve the DM group out of existence. It

views DM not as a potential partner for compromise, but as a

zero-sum adversary.

It’s an extremist position, to be sure; it nevertheless brings into

focus the primary antagonism that exists between software-

development and data-management stakeholders. This antag-

onism must be seen as a factor in the promotion of Hadoop as

a general-purpose platform for enterprise data management.

Hadoop was created to address the unprecedented challenges

associated with developing and managing data-intensive

distributed applications. The impetus and momentum behind

Hadoop originated with Web or distributed application devel-

opers. To some extent, Hadoop and other big data technology

projects are still largely programmer-driven efforts. This has

implications for their use on an enterprise-wide scale, because

software developers and data management practitioners have

very different worldviews. Both groups are accustomed to talk-

ing past one another. Each suspects the other of giving short

shrift to its concerns or requirements.

big data is an epistemic shift. it’s going to transform how we know and understand — how we perceive — the world.”“

Features

John O’Brien

Founder and CEO

Radiant Advisors

Dr. Robin Bloor Co-Founder and Principal Analyst

The Bloor Group

14 • rediscoveringBI Magazine • #rediscoveringBI

Get directions

In short, both groups resent one another. This resentment

isn’t symmetrical, however; there’s a power imbalance. For a

quarter century now, the DM group hasn’t just managed data

-- it’s been able to dictate the terms and conditions of access

to the data that it manages. In this capacity, it’s been able to

impose its will on multiple internal constituencies: not only

on software developers, but on line-of-business stakehold-

ers, too. The irony is that the per-

ceived inflexibility and unrespon-

siveness – the seeming indifference

– of DM stakeholders has helped to

bring together two other nominally

antagonistic camps; in their resent-

ment of DM, software developers

and the line of business have been

able to find common cause.

Few would deny that stakeholders

jealously guard their fiefdoms. This

is as true of software developers

and the line of business as it is of

their counterparts in the DM world.

Part of the problem is that DM

is viewed as an unreasonable or

uncompromising stakeholder: e.g. ,

DM practitioners have been unable

to meaningfully communicate the

logic of their policies; they’ve like-

wise been reluctant – or in some cases, unwilling – to revise

these policies to address changing business requirements. In

addition, they’ve been slow to adopt technologies or meth-

ods that promise to reduce latencies or which propose to

empower line-of-business users. Finally, DM practitioners are

fundamentally uncomfortable with practices – such as ana-

lytic discovery, with its preference for less-than-consistent

data – which don’t comport with data management best

practices.

Hadoop and Big Data in ContextThat’s where the zero-sum animus comes from. It explains

why some in business and IT

champion Hadoop as a technology to replace – or at the very

least, to displace – the DM status quo. There’s a much more

pragmatic way of looking at what’s going on, however.

This is to see Hadoop in context – i.e. , at the nexus of two

related trends: viz. , a decade-plus, bottom-up insurgency,

and a sweeping (if still coalescing) big data epistemic shift.

The two are related. Think back to the Bubonic Plague, which

had a destabilizing effect on the late-Medieval social order.

The depredations of the Plague effectively wiped out many

of the practices, customs, and (not to put too fine a point on

it) human stakeholders that might otherwise have contested

destabilization.

The Plague, then, cleared away the ante-status quo, creating

the conditions for change and transformation. Big data has

had a similar effect in data management – chiefly by raising

questions about the warehouse’s ability to accommodate

disruptions (e.g. , new kinds of data and new analytic use

cases) for which it wasn’t designed. Simply by claiming to

be Something New, big data raised questions about the DM

status quo.

This challenge was exploited by

well-established insurgent cur-

rents inside both the line of busi-

ness and IT. The former has been

fighting an insurgency against IT

for decades; however, in an age

of pervasive mobility, BYOD, social

collaboration, and (specific to the

DM space) analytic discovery, this

insurgency has taken on new force

and urgency.

IT, for its part, has grappled with

insurgency in its own ranks: the

agile movement, which most in

DM associate with project manage-

ment, began as a software develop-

ment initiative; it explicitly bor-

rowed from the language of politi-

cal revolution – the seminal agile

document is Kent Beck’s “Manifesto

for Agile Software Development,” published in 2001 – in

championing an alternative to software development’s top-

down, deterministic status quo.

Agility and insurgency have been slower to catch on in DM.

Nevertheless, insurgent pressure from both the line of busi-

ness and IT is forcing DM stakeholders (and the vendors who

nominally service them) to reassess both their strategies and

their positions.

However far-fetched, the possibility of a Hadoop-led chevau-

chée in the very heart of its enterprise fiefdom – with aid

and comfort from a line-of-business class that DM has too

often treated more as peasants than as enfranchised citizens

– snagged the attention of data management practitioners.

Big time.

ReinventionThe Hadoop chevauchée got the attention of DM practitio-

ners for another reason.

In its current state, Hadoop is no more suited for use as a

general-purpose, all-in-one platform for reporting, discovery,

and analysis than is the data warehouse. (See Sidebar: A Kludge Too Far?)Given the maturity of the DW, Hadoop is arguably much less

suited for this role. For all of its shortcomings, the data ware-

house is an inescapably pragmatic solution; (Contiued p21)DM practitioners learned what works chiefly by figuring out

Day One | Designing Modern Data PlatformsThese sessions provide an approach to confidently assess and make architecture changes, beginning with an understanding

of how data warehouse architectures evolve and mature over time, balancing technical and strategic value delivery. We break

down best practices into principles for creating new data platforms.

Day Two | Modern Data IntegrationThese sessions provide the knowledge needed for understanding and modeling data integration frameworks to make confident

decisions to approach, design, and manage evolving data integration blueprints that leverage agile techniques. We recognize

data integration patterns for refactoring into optimized engines.

Day Three | Databases for AnalyticsThese sessions review several of the most significant trends in analytic databases challenging BI architects today. Cutting through

the definitions and hype of big data in the market, NoSQL databases offer a solution for a variety of data warehouse requirements.

Register now at: http://radiantadiantadvisors.com

CAN'T MAKE IT? Catch us in San Francisco from May 28-30. Registration opens April 22nd. Use the priority code ReBI to save $150

At the Omni Downtown in Austin

AUsTiN, TXApril 29 - MAy 1

#sparkevent

rediscoveringBI - CDM Media[P16]Bringing BI and Big Data Together three things that make a “big”...

Documents

p10 QROPS p16 Funds Aug/Sept/Oct GIBRALTAR Aug-Sept-Oct 2012.pdf · p10 QROPS p16 Funds Aug/Sept/Oct 2012 ... and allied services ... EUROPA TRUST COMPANY LIMITED Tel: + (350) 200

P16, P16 AVC, P20, P24 - Technical Tool Solutions

Sustaining Effective P16 Community Engagement Councils P16 ... · Sustaining Effective P16 Community Engagement Councils October 25, 2018 ... MS Public School Accountability Standards

· ALUMINIUM WINDOWS p10-11 SLIDING SASH WINDOWS p12-13 TILT & TURN WINDOWS p14-15 WINDOW OPTIONS p16-21 ... MTC Qualified Installers Local Family run company

Machine Controller and AC Servo Drive Solutions Catalog · P17 P17 P16 P15 P14 P14 17 P P10 Developers and designers Developers and designers Manufacturers Operators Maintenance staff

Chattanooga State Building & Parking Map · P10 P2 P1 P3 P5 P4 P6 P7 P8 P9 P16 23 21 22 Academic and Administrative Buildings Parking Lots: One-way Traffic Cafeterias/Grills CBIH

p16 Polishasdfed

P4 · 2019. 5. 7. · P4 The Botanical Expertise Pierre Fabre Approach P10 Indicators P6 The four founding principles P6 P8 P12 P14 Innovate Guarantee Preserve Respect P16 Twelve

Strand - Home - Glenmore Park Learning · Web viewP7, P8, P9 Part 2 Recall and use multiplication facts up to 10 × 10 with automaticity MD S2 P10, P15, P16, P17 Relate multiplication

Big Maxiflex P10 brochure

HIGH POINT COMMUNITY FOUNDA TION · SAY YES TO EDUCATION Scholarships & School Success P5 ANNUAL GRANTS Improving Lives Together P10-11 JACK SLANE Three Things I Learned P16 SUMMER

BRANDBOOK 2018 - abrazi.com · 7 Pictured: O9-CRRH-H-SINI Chique Collection Earrings p8 – Fine & Flair p10 – Notable p12 – Uptown p14 – Prime p16 – Stunning p18 – Glimmer,

State of Forest Tree Improvement and related activities in ...nordicforestresearch.org/healgencar/wp-content/uploads/sites/8/201… · p p td18 e15 p16 jm10 a4 p10 v7 11 vr46 v6 jm10

The Leading Reference for technology-based products ... · Playing to win, Blue Ocean strategy and other tools p10 Managing the whole product set p16 More than a vision - it has to

STEWARDSHIP REPORT...China’s green electricity revolution P08 What happens when politicians move fast and break things P10 Plant power P16 Eden bonds P18 Change is a consistent theme

169 P5 P6 P8 8 P8 P10 2 3 P3 P5 P6 P6 P7 P8 P9 P10 P10 P11 P12 P13 P13 P14 P15 P16 P19 P20 P14 P15 P16 P17 P18 P1 P2 P3 P5 P5 P6 P6 P7 P4 P5 P6 P8 P8 P8 P10 P11 P12 P13 P3 P5 P6 P6

Propeller Board of Education 32900 A - Parallax, Inc. · 2012-09-25 · p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 p14 p15 p0 p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 p15 p16 p17 p18

Financial Results for Fiscal 2015, the Year Ended … · Financial Results for Fiscal 2015, ... P6. P7 . P8 . P9 . P10 . P11 . P12 . P13 . P14 . P15 . P16 . P17 . ... Interim Dividend

p10 National Sewa Day Vishwa Sangh Shibir - HSS UKhssuk.org/wp-content/uploads/2014/05/ss_jan_mar2011_web.pdf · p10 National Sewa Day Vishwa Sangh Shibir p16 Experiences Sangh Khel

#13 #14 p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 - RECORDS … p6 p7 p8 p9 p10 p11 p12 p13 p14 p15 p16 p17 p18 p19 p20 Title colors_re Created Date 8/12/2015 5:14:56 PM