30
Page 1 © Hortonworks Inc. 2011 2015. All Rights Reserved Comprehensive Analytics on the Hortonworks Data Platform We do Hadoop.

Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Embed Size (px)

Citation preview

Page 1: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Comprehensive Analytics on the

Hortonworks Data Platform

We do Hadoop.

Page 2: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Page 3: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Back to 2005…

Page 4: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Vertical Scaling

RAM

CPU

Storage

Page 5: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

RAM

CPU

Storage

Vertical Scaling

Page 6: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

RAM

CPU

Storage

Vertical Scaling

Page 7: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Horizontal Scaling

RAM

CPU

Storage

Page 8: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Horizontal Scaling

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

Page 9: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

RAM

CPU

Storage

Horizontal Scaling

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

Page 10: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

RAM

CPU

Storage

Self Healing System

Page 11: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 11 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

1 ° ° ° ° °

° ° ° ° ° N

HDFS (Hadoop Distributed File System)

MapReduce

Hadoop 1.0

Page 12: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 12 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Page 13: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 13 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Page 14: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 14 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Hadoop 2.0

Clickstream Web & Social

Geolocation Sensor & Machine

Server Logs

Unstructured

SO

UR

CE

S

Existing Systems

ERP CRM SCM

AN

AL

YT

ICS

Data

Marts

Business

Analytics

Visualization

& Dashboards

AN

AL

YT

ICS

ApplicationsBusiness

Analytics

Visualization

& Dashboards

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

°

HDFS (Hadoop Distributed File System)

YARN: Data Operating System

Interactive Real-TimeBatch Partner ISVBatch BatchMPP

EDW

Page 15: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 15 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Hortonworks Data Platform 2.2

YARN: Data Operating System(Cluster Resource Management)

1 ° ° ° ° ° ° °

° ° ° ° ° ° ° °

Ap

ach

e P

ig

° °

° °

° ° °

° ° °

HDFS (Hadoop Distributed File System)

GOVERNANCE BATCH, INTERACTIVE & REAL-TIME DATA ACCESS

Apache Falcon

Ap

ach

e H

ive

Ca

scad

ing

Ap

ach

e H

Ba

se

Ap

ach

e A

ccu

mu

lo

Ap

ach

e S

olr

Ap

ach

e S

park

Ap

ach

e S

torm

Apache Sqoop

Apache Flume

Apache Kafka

SECURITY

Apache Ranger

Apache Knox

Apache Falcon

OPERATIONS

Apache Ambari

Apache

Zookeeper

Apache Oozie

Page 16: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 16 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Hortonworks: Hadoop for the Enterprise

We Do Hadoop

Winter 2015Version 1.1

Page 17: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 17 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Who we are

2005

2011

24

900+

100%

5 out of 5

32.000

Apache Hadoop at Yahoo!

Inception of Hortonworks

Developers and Architects

Employees

Renewal Rate

Support Score*

Number of Nodes at Yahoo!

30+ Migrations

300+ Customers

* The Forrester Wave Big Data Hadoop Solutions Q1 2014

Partner

600+

Page 18: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 18 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

IN-MEMORY

HIGH-PERFORMANCE

ANALYTICS

BUSINESS INTELLIGENCE

DATA VISUALIZATION

DATA MANAGEMENT

Why SAS?

Page 19: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 19 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

SAS can work with Hadoop, lifting data in a purpose-built advanced analytics

in-memory environment

SAS can treat Hadoop just as any other data source, pulling data from

Hadoop, when it is most convenient

SAS can work directly in Hadoop, leveraging the distributed processing

capabilities of Hadoop

SAS is the only vendor who supports all of these methods

Page 20: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 20 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

SAS accesses and extracts data from Hadoop to a

SAS server for processing, and writes results back.

Bridge to traditional SAS environments

Hadoop treated as just “another data source”

Performance limited to single pipe bandwidth

DATA MOVEMENT

SAS + from Hortonworks

Page 21: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 21 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

SAS accesses and processes Hadoop data on SAS Servers

while keeping the data and computations massively parallel.

Supports advanced analytics via shared computing

Allows the scaling of data storage and analytics separately

Ideal when analytical rigor, sophistication and governance are required

DATA LIFT INTO MEMORY

SAS + with Hortonworks

Page 22: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 22 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

SAS processes data directly in the Hadoop cluster.

SAS LOGIC

SAS Embedded Process enables scalable SAS compute in Hadoop

SAS compute is orchestrated via Hadoop technology (YARN)

Data manipulation, data quality, and scoring support

Ideal when all data is landing in Hadoop, and Hadoop is the proper place for

processing

SAS + in Hadoop

Page 23: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 23 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Page 24: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 24 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

About Rogers Media

–Great Brands

–Media advertising revenue a priority

–Audience Strategy the future

2013 CONSOLIDATED REVENUE BY SEGMENT (%)

Page 25: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

AUDIENCE BUSINESS CHALLENGES

1. UNDERSTAND AUDIENCEHaving the largest volume of data sets, audience

segments/profiles in Canada while leading the Canadian marketplace in privacy and governance

3. ENGAGE AUDIENCEDriving engagement across platforms and formats

2. FIND AUDIENCEBeing leaders in identifying and targeting audiences

across channels, platforms and devices

4. MEASURE AUDIENCEExceeding client expectations with transparent reporting, the most accurate attribution models

Page 26: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

AUDIENCE PLATFORM – THE DATA LAKE

- Land massive click stream log files:

- 100+ M records / day;

- 30 million unique IDs / month

- Cost effective / competitive

- Lean methodology

- Landed data always available if requirements should change

- Data definition on read

- Adoption of the Data Lake framework

Page 27: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 27 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

more data

&

better algorithms

Summary

Page 28: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 28 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Hortonworks Jumpstart Package

Proposal for a simple production-ready

Hadoop cluster in one week

Page 29: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 29 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Hadoop is a Platform Decision

Adoption follows a consistent journeyData architecture efficiencies, new analytic apps, and

ultimately to a “data lake”.

HDP: A centralized architecture built on YARNAny application, any data, anywhere.

HDP: A completely open data platformPlatforms are ultimately defined by open communities.

HDP subscription supports entire lifecycleWorld class experience to ensure success from architecture to

production to expansion.

Page 30: Comprehensive Analytics on the Hortonworks Data … · Comprehensive Analytics on the Hortonworks Data Platform ... This presentation contains forward-looking statements involving

Page 30 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Cautionary Statement Regarding Forward-Looking Statements

This presentation contains forward-looking statements involving risks and uncertainties.

Such forward-looking statements in this presentation generally relate to future events, our

ability to increase the number of support subscription customers, the growth in usage of the

Hadoop framework, our ability to innovate and develop the various open source projects

that will enhance the capabilities of the Hortonworks Data Platform, anticipated customer

benefits and general business outlook. In some cases, you can identify forward-looking

statements because they contain words such as “may,” “will,” “should,” “expects,” “plans,”

“anticipates,” “could,” “intends,” “target,” “projects,” “contemplates,” “believes,” “estimates,”

“predicts,” “potential” or “continue” or similar terms or expressions that concern our

expectations, strategy, plans or intentions. You should not rely upon forward-looking

statements as predictions of future events. We have based the forward-looking statements

contained in this presentation primarily on our current expectations and projections about

future events and trends that we believe may affect our business, financial condition and

prospects. We cannot assure you that the results, events and circumstances reflected in the

forward-looking statements will be achieved or occur, and actual results, events, or

circumstances could differ materially from those described in the forward-looking

statements.

The forward-looking statements made in this prospectus relate only to events as of the date

on which the statements are made and we undertake no obligation to update any of the

information in this presentation.

Trademarks

Hortonworks is a trademark of Hortonworks, Inc. in the United States and other

jurisdictions. Other names used herein may be trademarks of their respective owners.