Upload
nguyenkhuong
View
231
Download
2
Embed Size (px)
Citation preview
Page 1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Comprehensive Analytics on the
Hortonworks Data Platform
We do Hadoop.
Page 2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Page 3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Back to 2005…
Page 4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Vertical Scaling
RAM
CPU
Storage
Page 5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
RAM
CPU
Storage
Vertical Scaling
Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
RAM
CPU
Storage
Vertical Scaling
Page 7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Horizontal Scaling
RAM
CPU
Storage
Page 8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Horizontal Scaling
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
Page 9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
RAM
CPU
Storage
Horizontal Scaling
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
Page 10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
Self Healing System
Page 11 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
1 ° ° ° ° °
° ° ° ° ° N
HDFS (Hadoop Distributed File System)
MapReduce
Hadoop 1.0
Page 12 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Page 13 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Page 14 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hadoop 2.0
Clickstream Web & Social
Geolocation Sensor & Machine
Server Logs
Unstructured
SO
UR
CE
S
Existing Systems
ERP CRM SCM
AN
AL
YT
ICS
Data
Marts
Business
Analytics
Visualization
& Dashboards
AN
AL
YT
ICS
ApplicationsBusiness
Analytics
Visualization
& Dashboards
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
°
HDFS (Hadoop Distributed File System)
YARN: Data Operating System
Interactive Real-TimeBatch Partner ISVBatch BatchMPP
EDW
Page 15 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hortonworks Data Platform 2.2
YARN: Data Operating System(Cluster Resource Management)
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
Ap
ach
e P
ig
° °
° °
° ° °
° ° °
HDFS (Hadoop Distributed File System)
GOVERNANCE BATCH, INTERACTIVE & REAL-TIME DATA ACCESS
Apache Falcon
Ap
ach
e H
ive
Ca
scad
ing
Ap
ach
e H
Ba
se
Ap
ach
e A
ccu
mu
lo
Ap
ach
e S
olr
Ap
ach
e S
park
Ap
ach
e S
torm
Apache Sqoop
Apache Flume
Apache Kafka
SECURITY
Apache Ranger
Apache Knox
Apache Falcon
OPERATIONS
Apache Ambari
Apache
Zookeeper
Apache Oozie
Page 16 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hortonworks: Hadoop for the Enterprise
We Do Hadoop
Winter 2015Version 1.1
Page 17 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Who we are
2005
2011
24
900+
100%
5 out of 5
32.000
Apache Hadoop at Yahoo!
Inception of Hortonworks
Developers and Architects
Employees
Renewal Rate
Support Score*
Number of Nodes at Yahoo!
30+ Migrations
300+ Customers
* The Forrester Wave Big Data Hadoop Solutions Q1 2014
Partner
600+
Page 18 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
IN-MEMORY
HIGH-PERFORMANCE
ANALYTICS
BUSINESS INTELLIGENCE
DATA VISUALIZATION
DATA MANAGEMENT
Why SAS?
Page 19 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
SAS can work with Hadoop, lifting data in a purpose-built advanced analytics
in-memory environment
SAS can treat Hadoop just as any other data source, pulling data from
Hadoop, when it is most convenient
SAS can work directly in Hadoop, leveraging the distributed processing
capabilities of Hadoop
SAS is the only vendor who supports all of these methods
Page 20 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
SAS accesses and extracts data from Hadoop to a
SAS server for processing, and writes results back.
Bridge to traditional SAS environments
Hadoop treated as just “another data source”
Performance limited to single pipe bandwidth
DATA MOVEMENT
SAS + from Hortonworks
Page 21 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
SAS accesses and processes Hadoop data on SAS Servers
while keeping the data and computations massively parallel.
Supports advanced analytics via shared computing
Allows the scaling of data storage and analytics separately
Ideal when analytical rigor, sophistication and governance are required
DATA LIFT INTO MEMORY
SAS + with Hortonworks
Page 22 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
SAS processes data directly in the Hadoop cluster.
SAS LOGIC
SAS Embedded Process enables scalable SAS compute in Hadoop
SAS compute is orchestrated via Hadoop technology (YARN)
Data manipulation, data quality, and scoring support
Ideal when all data is landing in Hadoop, and Hadoop is the proper place for
processing
SAS + in Hadoop
Page 23 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Page 24 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
About Rogers Media
–Great Brands
–Media advertising revenue a priority
–Audience Strategy the future
2013 CONSOLIDATED REVENUE BY SEGMENT (%)
AUDIENCE BUSINESS CHALLENGES
1. UNDERSTAND AUDIENCEHaving the largest volume of data sets, audience
segments/profiles in Canada while leading the Canadian marketplace in privacy and governance
3. ENGAGE AUDIENCEDriving engagement across platforms and formats
2. FIND AUDIENCEBeing leaders in identifying and targeting audiences
across channels, platforms and devices
4. MEASURE AUDIENCEExceeding client expectations with transparent reporting, the most accurate attribution models
AUDIENCE PLATFORM – THE DATA LAKE
- Land massive click stream log files:
- 100+ M records / day;
- 30 million unique IDs / month
- Cost effective / competitive
- Lean methodology
- Landed data always available if requirements should change
- Data definition on read
- Adoption of the Data Lake framework
Page 27 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
more data
&
better algorithms
Summary
Page 28 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hortonworks Jumpstart Package
Proposal for a simple production-ready
Hadoop cluster in one week
Page 29 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hadoop is a Platform Decision
Adoption follows a consistent journeyData architecture efficiencies, new analytic apps, and
ultimately to a “data lake”.
HDP: A centralized architecture built on YARNAny application, any data, anywhere.
HDP: A completely open data platformPlatforms are ultimately defined by open communities.
HDP subscription supports entire lifecycleWorld class experience to ensure success from architecture to
production to expansion.
Page 30 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Cautionary Statement Regarding Forward-Looking Statements
This presentation contains forward-looking statements involving risks and uncertainties.
Such forward-looking statements in this presentation generally relate to future events, our
ability to increase the number of support subscription customers, the growth in usage of the
Hadoop framework, our ability to innovate and develop the various open source projects
that will enhance the capabilities of the Hortonworks Data Platform, anticipated customer
benefits and general business outlook. In some cases, you can identify forward-looking
statements because they contain words such as “may,” “will,” “should,” “expects,” “plans,”
“anticipates,” “could,” “intends,” “target,” “projects,” “contemplates,” “believes,” “estimates,”
“predicts,” “potential” or “continue” or similar terms or expressions that concern our
expectations, strategy, plans or intentions. You should not rely upon forward-looking
statements as predictions of future events. We have based the forward-looking statements
contained in this presentation primarily on our current expectations and projections about
future events and trends that we believe may affect our business, financial condition and
prospects. We cannot assure you that the results, events and circumstances reflected in the
forward-looking statements will be achieved or occur, and actual results, events, or
circumstances could differ materially from those described in the forward-looking
statements.
The forward-looking statements made in this prospectus relate only to events as of the date
on which the statements are made and we undertake no obligation to update any of the
information in this presentation.
Trademarks
Hortonworks is a trademark of Hortonworks, Inc. in the United States and other
jurisdictions. Other names used herein may be trademarks of their respective owners.