28
1 Intel and Cloudera: Going beyond Traditional Analytics Gab Gennai Technical Services Director APAC August 2014 Intel and Cloudera: Going Beyond Traditional Analytics Gabriel Gennai Technical Services Director APAC, Cloudera

Gab Genai Cloudera - Going Beyond Traditional Analytic

Embed Size (px)

DESCRIPTION

Cloudera APJ

Citation preview

1

Intel and Cloudera: Going beyond Traditional Analytics Gab Gennai Technical Services Director – APAC August 2014

Intel and Cloudera: Going Beyond Traditional Analytics

Gabriel Gennai Technical Services Director – APAC, Cloudera

2

Cloudera Snapshot

Founded 2008, by former employees of

Employees Today ~ 700

World Class Support 24x7 Global Staff Pro-active & Predictive Support Programs

Mission Critical Thousands of Enterprise Users Over 400 Paying Subscription Customers

The Largest Ecosystem Over 1000+ Partners

Cloudera University Over 50,000+ Trained

Open Source Leaders Cloudera Employees are Leading Developers & Contributors

Total Capital Raised A lot! (from Intel, Google, Dell, T. Rowe Price, Accel, Greylock)

Mission Help Organizations Leverage the Power of All Their Data to Ask Bigger Questions.

3

Expanding Big Data Requires A New Approach

1980s Bring Data to Compute

Now Bring Compute to Data

Relative size & complexity

Data Information-centric

businesses use all data:

Multi-structured, internal & external data

of all types

Compute

Compute

Compute

Process-centric businesses use:

• Structured data mainly • Internal data only • “Important” data only

Compute

Compute

Compute

Data

Data

Data

Data

4

The Old Way: Bringing Data to Compute

Complex Architecture • Many special-purpose

systems • Moving data around • No complete views

4

Missing Data • Leaving data behind • Risk and compliance • High cost of storage

1

Time to Data • Up-front modeling • Transforms slow • Transforms lose data

2

Cost of Analytics • Existing systems strained • No agility • “BI backlog”

3

SERVERS MARTS EDWS DOCUMENTS STORAGE SEARCH ARCHIVE

ERP, CRM, RDBMS, MACHINES FILES, IMAGES, VIDEOS, LOGS, CLICKSTREAMS EXTERNAL DATA SOURCES

5

Why Is This Happening Now? • Consumption • Connections • Activity • Pace & Speed

• Instrumentation • Sensor Data • Location Points, Metrics • Tweets, Images • Fuel band

• Exploration • Ease of Accessibility • Faster research

• Value • Asset • Drive new business Models

6

SERVERS MARTS EDWS DOCUMENTS STORAGE SEARCH ARCHIVE

ERP, CRM, RDBMS, MACHINE FILES, IMAGES, VIDEOS, LOGS, CLICKSTREAMS EXTERNAL DATA SOURCES

The New Way: Bringing Compute to Data

Diverse Analytic Platform • Bring applications to data • Combine different workloads on

common data (i.e. SQL + Search) • True analytic agility

4

1

2

3 4

Active Compliance Archive • Full fidelity original data • Indefinite time, any source • Lowest cost storage

1

Persistent Staging • One source of data for all analytics • Persist state of transformed data • Significantly faster & cheaper

2

Self-Service Exploratory BI • Simple search + BI tools • “Schema on read” agility • Reduce BI user backlog requests

3

7

Open Source Scalable Flexible Cost-Effective

Managed

Open Architecture

Secure and Governed

From Hadoop to an Enterprise Data Hub

BATCH PROCESSING

ANALYTIC SQL

SEARCH ENGINE

MACHINE LEARNING

STREAM PROCESSING

3RD PARTY APPS

WORKLOAD MANAGEMENT

STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE

DA

TA

MA

NA

GEM

ENT

SYSTEM

MA

NA

GEM

ENT

CLOUDERA’S ENTERPRISE DATA HUB

Filesystem Online NoSQL

8

Enabling the App Store of Big Data

Software (BI, Analytics, & Data Integration)

System Integration Cloud & MSP

Hardware Database/Platform

Note: Display Cloudera Connect Platinum and Gold partners only

9

2014 Gartner MQ for Data Warehouse DBMS

“A data warehouse DBMS is now expected to coordinate data virtualization strategies,

and distributed file and/or processing approaches, to address changes in data management and access requirements.”

10

Discover New Use Cases

ON-LINE SERVICES / SOCIAL MEDIA People & career matching Website optimization

HEALTH CARE Patient sensors, monitoring, EHRs Quality of care

FINANCIAL SERVICES Risk & portfolio analysis New products

MEDIA / ENTERTAINMENT Viewers / advertising effectiveness

CONSUMER PACKAGED GOODS Sentiment analysis of what’s hot, customer service

TRAVEL & TRANSPORTATION Sensor analysis for optimal traffic flows Customer sentiment

RETAIL Consumer sentiment Optimized marketing

LAW ENFORCEMENT & DEFENSE Threat analysis, Social media monitoring, Photo analysis

EDUCATION & RESEARCH Experiment sensor analysis

LIFE SCIENCES Clinical trials Genomics

AUTOMOTIVE Auto sensors reporting location, problems

COMMUNICATIONS Location- based advertising

HIGH TECHNOLOGY / INDUSTRIAL MFG. Mfg quality Warranty analysis

UTILITIES Smart Meter analysis for network capacity

OIL & GAS Drilling exploration sensor analysis

11 11

High Res Images

• Multiple times a day • 1TB per day • Require high processing due to

zero light in space • RDBMS could not scale • Now using Cloudera Manager,

Impala, Search to perform Bus Analysis.

• Compare images from today or many years

12 12

R & D Decisions

• 5-10 years for new crop development.

• Data was stored in Silo’s • Could not combine results

• Bring Data together both

internal & external (new data) • Researches now share data • Examine data at speed. • Data Driven decisions and

narrow time to develop

13 13

Oil & Gas Discovery

• Cost reduction of deep water drill-ships

• Analyzing waves & Seismic data & convert to pictures

• Collects X & Y coordinates, wave source & target, way it was collected.

• Importance because it cost $1m per day for Drill ships.

• The more data it collects, the better the search.

14 14

HealthCare

• Clinical, Financial & Operational data kept separate.

• Analysis took days & months • Increasing costs of storage

• Cloudera Search & Impala. • Quicker Analysis – now minutes

& hours. • Quick decisions • 360 degree patient view • What equipment to buy based

on Analysis. • Can patient go to local doctor ?

15 15

Insurance

• Could not predict patterns and merge information.

• Policies were based on minimal information.

- Detailed historical weather patterns by neighborhood

- Actual flooding based on comprehensive data

- Detailed fine-grained topographical maps

- Erosion data from coastal dunes - Detailed construction details - Aerial photos & survey data

16 16

Government

• Problem with Web Access through VPN.

• Tracked data to identify suspicious behavior.

• Prevent Fraud requirement • Logs grew too big.

• Cloudera deployed to manage

Petabytes • Hbase used to detect patterns

and Fraud. • Perform in Real Time. • Find the problem first.

17

Converge on One Open Source Platform

©2014 Cloudera, Inc. All rights reserved.

• Most stable, compatible, and mature Hadoop distribution

• Leading SQL functionality & performance (Impala)

• Deepest management and governance capabilities

• 150 Hadoop developers

• 80 open source committer seats

• The only distribution with performance and security enhanced from the silicon up

• Leading security capabilities including encryption, access control, and auditing

• Long-standing commitment to open source with 1000 developers working on Linux, KVM, Xen, Java, OpenStack, Hadoop

• 50 Hadoop developers and 12 committers

18

A High Level View of the Journey

Data Science

Agile Exploration

ETL Acceleration

Operational Efficiency (Faster, Bigger, Cheaper)

Transformative Applications (New Business Value)

Cheap Storage

Business IT

EDW Optimization

Converged Analytics

19

Thank You! Gab Gennai [email protected] +61 411606050

20

Cloudera Partner Program Overview

Updated Jan 15, 2014

21

Cloudera CONNECT Partner Levels

• Not all partners are created equal

• Partner level determines the benefits and requirements for a partner

Bronze Silver Gold Platinum

22

Partner Program Progression

Visit Cloudera.com

•View website

•Identify areas of synergy

•Explore Cloudera Connect program information

Submit Application

•Register first on Cloudera.com

•Select appropriate program

•Explain partnership goals

•Self-identify

Receive Acceptance

•Start as bronze member

•Receive Welcome

•Access Cloudera Connect portal and partner logo & guidelines

•Request developer license without support

•Self-create assets

Deepen Relationship

•Take online e-learning classes

•Use 20% Discount on Cloudera training

•Work on product certification

•Use Cloudera Connect logo on partner collateral

•Meet in the field on joint opportunities

Explore Silver Level Membership

•Invest in jointly created collateral

•Receive marketing support

•Increase collaboration with Cloudera

• Receive more sales assistance for qualified opportunities

Invitation to Key Partners to become a Gold Partner

•Must drive a certain level of revenue with us/for us.

•Well-defined joint sales play.

Very Select Partners invited to Platinum

•Must drive >$2M+ in revenue for a defined joint sales play and be an industry leader

23

Cloudera Connect Partner programs ClouderaConnect

PartnershipLevelClouderaConnectPartnershipLevel ClouderaConnectPartnershipLevel ClouderaConnectPartnershipLevel

Bronze Silver Gold Platinum

ProgramAgreement

ClouderaResellerAgreement OnlineApplicationonly ü ü ü

ProgramMembershipFee N/A N/A US$12k US$15k

Training&Certification

N/A 2Trained&CertifiedAdmin 4Trained&CertifiedAdmin 6Trained&CertifiedAdmin

N/A 2Trained&CertifiedDeveloper 4Trained&CertifiedDeveloper 6Trained&CertifiedDeveloper

N/A 2TrainedDataAnalysts 4Trained&CertifiedDataAnalysts 6Trained&CertifiedDataAnalysts

SalesTraining[2] ü ü ü ü

N/A N/A 1BusinessDevelopmentManager 1BusinessDevelopmentManager

N/A N/A N/A 2SolutionsArchitects

Sales&Marketing

PartnerProfileinClouderaWeb N/A ü ü ü

Forecasting N/A Quarterly Monthly Weekly

ClouderaFocusedMarketing

InitiativesN/A 1perQuarter 1perQuarter 1perMonth

DefinedBusinessPlan N/A N/A Annually Quaterly

ClouderaIntegratedSolutions N/A N/A Min1ClouderaSolution Min3ClouderaSolutions

BusinessCommitment

Min.QualifiedPipelines N/A US$500k US$2.0m US$4.0m

ACVSalesQuota N/A US$250K US$1.0m US$2.0m

RelationshipManagement

ExecutiveLevelSponsorship N/A ü ü ü

ProgramRequirements

TechnicalTraining[1]

DedicatedResources[3]

25

How to be Cloudera’s Auth. partner?

• Once your application is approved by Cloudera, you will receive an email with links to the Cloudera Connect partner portal. The portal contains marketing, sales and training resources you may use to start working with us.

• Enjoy 20% off List for any public training courses

• Submit Deal Registration for joint sales activities when found sales opp (subject to approval) at https://www.cloudera.com/content/partners/en/company-profile/deal-registration.html

26

Start Your Journey with FREE Online Training Resources @ Cloudera Partner Portal

• Hadoop Essentials Training

• Cloudera Manager Training

• Hadoop Tutorials

• The Hadoop Ecosystem

• Resources for Administrators

• Resources for Developers

• Resources for Data Analysts

• Attend hand-on ILT courses at nearest Cloudera Authorized Training Centers • Complete Cloudera Certification at your nearest Pearson VUE Test Centers http://www.pearsonvue.com/cloudera/activity

28