The Reality of Bigdata - #Beltech2014

Preview:

Citation preview

The Reality of Big Data

#beltech2014

#1 – What problem are you trying to solve?

Most of SME’s problems aren’t Big Data, it’s just data.

Without a question you are wasting your time.

#2 – Data will need cleaning

Roughly 80% of your data project will be getting the data into shape

before processing.

Btiany Spears

#3 – Hadoop, on it’s own, will NOT give you the answers.

#3 – Hadoop, on it’s own, will NOT give you the answers.

(The Big Data version of “putting it in the cloud”)

If anyone says, “will Hadoop just give us the answers” or “put it in the

cloud”, do this….

Spit on one, or both, of their feet and bite your thumb while shouting:

“The fig of Spain!”.

#4 – Do you actually need Hadoop?

A well crafted algorithm may give you more benefit.

It’s about knowing the right questions.

And refining and refining and refining…..

The first run won't work at allThe second only makes you wonder

The third will have you on your knees.....

#5 – Data changes

…especially when you don’t own it.

If you feel your data has value then retain it.

If your data passes over the “creepy line” then definitely retain it.

#6 – Skills are in short supply

Work with what you have.

Play with data, it’s the best way to learn.

Collaborate with others to fill the skills gaps.

Thank you

http://about.me/jasebell

@hadooping