15
@wattstev e Hadoop for the disillusioned Steve Watt, Red Hat CC flickr rubenswieringa

Hadoop for the disillusioned

Embed Size (px)

Citation preview

Page 1: Hadoop for the disillusioned

@wattsteve

Hadoop for the disillusioned Steve Watt, Red Hat

CC flickr rubenswieringa

Page 2: Hadoop for the disillusioned

@wattsteve

Page 3: Hadoop for the disillusioned

@wattsteve

Wired Magazine - July 2008

Page 4: Hadoop for the disillusioned

@wattsteve

Hadoop in 2013

CC flickr lowfatbrains

Platform Layers Technologies

Computational Runtimes

YARN, GiRAPH, MapReduce, HBase, Phoenix, Spark/BDAS, Drill, Impala, Stinger & more

FileSystems Azure, CassandraFS, CephFS, CleverSafe, GlusterFS, GridGain, HDFS, LustreMapR FS, S3, SWIFT, Quantcast FS, Symantec VCFS & more

Infrastructures System on a Chip, x86, Virtualization and Cloud

Distributions Cloudera, Hortonworks, IBM, Intel, MapR, WanDisco

Page 5: Hadoop for the disillusioned

@wattsteveSource: Gartner Hype Cycle

Page 6: Hadoop for the disillusioned

@wattsteveCC flickr kakadu

Your data is growing beyond your ability to manage & query it

Page 7: Hadoop for the disillusioned

@wattsteveCC flickr martijnsnels

Save money when asking the same questions of your data

Page 8: Hadoop for the disillusioned

@wattsteve

Geoffrey Moore’s Technology Adoption Lifecycle

CHASM

Innovators EarlyAdopters

EarlyMajority

LateMajority

Laggards

Hadoop Customer, “Great, but now what?”

Page 9: Hadoop for the disillusioned

@wattsteveCC flickr cbcastro

new

and build data products

Page 10: Hadoop for the disillusioned

@wattsteveCC flickr birdwatcher63

Ask your domain experts and LOB folks what unanswered questions they have Where can you get the data you need to answer that question? (domain experts should know

where to get it) Some of this data may be outside your organization (Social Media, Sensor Data, Data

brokerages/Marketplaces, Web Pages) and some of it may be inside. If the data for the query doesn’t exist, figure out how to instrument or gather it. Pair your domain experts with your data engineers so they can work out how to obtain and

massage the data given the types of queries desired

Page 11: Hadoop for the disillusioned

@wattsteveCC flickr syume

• Building data products is a similar exercise except that it involves typical product planning, such as identifying a market.

• This is also a great way for an organization to explore what assets they have within their data

Page 12: Hadoop for the disillusioned

@wattsteve

Mapping the night sky

CC flickr bobfamiliar

Page 13: Hadoop for the disillusioned

@wattsteveCC flickr oxfam

Analyzing farm soil content to predict human conflict

Page 14: Hadoop for the disillusioned

@wattsteveCC flickr flodigrip

Crisis Management for the Chilean Earthquake

Page 15: Hadoop for the disillusioned

@wattsteve

Thanks for listening

Steve Watt [email protected]