22
Charles Zedlewski | SVP Products, Cloudera Cloud strategies that work for government 1

Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Charles Zedlewski | SVP Products, Cloudera

Cloud strategies that work for government

1

Page 2: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Public cloud adoption is rising

Page 3: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Public cloud adoption is rising

Page 4: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Cloudera customers are leading the way

Page 5: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

One platform supports 4 kinds of applications

OPERATIONS

Cloudera Manager

Cloudera Director

DATA MANAGEMENT

Cloudera Navigator

Encrypt and KeyTrustee

Optimizer

STRUCTURED

Sqoop

UNSTRUCTURED

Kafka, Flume

PROCESS, ANALYZE, SERVE

UNIFIED SERVICES

RESOURCE MANAGEMENT

YARN

SECURITY

Sentry, RecordService

STORE

INTEGRATE

BATCH

Spark, Hive, Pig MapReduce

STREAM

Spark

SQL

Impala

SEARCH

Solr

OTHER

Kite

NoSQL

HBaseOTHER

Object Store

FILESYSTEM

HDFSRELATIONAL

Kudu

Analytic DBMS

Data Engineering

Operational DBMS

Data Science

Page 6: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

OPERA

TIONS

DATAM

ANAGEM

ENT

UNIFIEDSERVICES

PROCESS,ANALYZE,SERVE

STORE

INTEGRATE

Store and process unlimited data fast and

cost-effectively.

Data Science & Engineering

Explore, analyze, and understand all your data.

Analytic Database

Build data-driven productsto deliver real-time

insights.

Real-Time Applications

Really, different configurations of the same platform

Page 7: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

In many environments multiple applications share a single, multi-tenant cluster

Page 8: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

In cloud environments, often more separate, specialized clusters tuned for each application

Object Store Object Store HDFSPrivate Cloud

Page 9: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Open Environment

Run the same platform in

different clouds or on

bare metal, so customers

can move as needed

without migration

or retraining

Open Ecosystem

1000’s of applications that

need to run on the platform with

the assurance of compatibility

across releases and clouds

Open Source

Avoid vendor lock-in, and

leverage components

supported by the committers

who drive the

community roadmap

But openness is even more important in the cloud

Page 10: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Open Environment

Run the same platform in

different clouds or on

bare metal, so customers

can move as needed

without migration

or retraining

Open Ecosystem

1000’s of applications that

need to run on the platform with

the assurance of compatibility

across releases and clouds

Open Source

Avoid vendor lock-in, and

leverage components

supported by the committers

who drive the

community roadmap

But openness is even more important in the cloud

Page 11: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Optimizing Cloudera for cost & convenience in the cloud

Page 12: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Transience for flexibility, lower TCO and risk

Unified platform, from ingest to insight and action

Object Store

Hybrid support formultiple environments

STORE

COMPUTE

Optimizing Data Engineering & Data Science workloads for the cloud

Page 13: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Benefits of Data Engineering with ClouderaLower TCO and increased flexibility on a trusted enterprise data platform

Increased FlexibilityEnd-to-End

for the EnterpriseLower TCO

Multi-cloud

• Shop across providers:

Amazon, Google, Microsoft

Deliver On-Demand

• Immediate access to large

compute with fast cluster

provisioning

• Self-service for developers

Optimize and Isolate

• Tailor infrastructure for the job

• Run different software versions

• Enable more experimentation

with less opportunity cost

Build Complete Data Apps

• Ingest, stream, process,

explore analyze, model, and

serve on the same platform

• Shared data with object store

integration

• Cluster metadata persistence

• Common compliance-ready

security and governance

frameworks

Manage Costs

• Transience for dev/test,

ETL, and data science

• Usage-based pricing

• Spot instance support

Page 14: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Optimizing Analytic DBMS workloads for the cloud

Only pay for what you need, when you need it

▪ Transient clusters▪ Object storage centric▪ Cloud-native deployment

ETL

Reduce Operating

Costs

New Insights, New

Revenue

BI/Analytics

Explore and analyze all data, wherever it lives

▪ Long-running clusters▪ Object storage or local

storage▪ Lift-and-shift deployment

Page 15: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Add Use Cases, Analytics, and

Data On-Demand

• Avoid the IT backlog with instant

access to all data

• On-demand clusters query

directly on shared object storage

Predictable Results Whenever

You Want

• Consistent query performance,

even during peak times

• Multi-tenancy via isolated

clusters on shared data

Just-in-Time Resources

• Real-time capacity for your

needs, as they change

• Elastically grow/shrink your

cluster via decoupled

architecture

Contention-Free ETL

• ETL anytime without impacting

other workloads or risking SLAs

• Separate ETL clusters as-

needed on shared data

Benefits of Cloudera’s Analytic DatabaseETL and BI/Analytics in the cloud

Page 16: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Operational Database

Durable, low latency storage for web

applications, message stores, and mission

critical operational activities.

Web-Scale Data Depot

Identifying meaningful events based

on multiple data streams and taking

action.

Complex Event Processing

Use data and current/past events to

score and serve the likelihood of

subsequent events.

Model Scoring/Serving

Page 17: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Operational Database in the CloudPublic Cloud Benefits

Cost Goals

• Low-cost backup and

disaster recovery

• Development and testing

environments easy to

deploy and

decommission

Convenience Goals

• Elastic growth for tightly

provisioned workloads

makes expansion easy,

and enables a lower-cost

steady state

• Fast and easy

provisioning of additional

clusters helps projects

move quickly

Page 18: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Pulling it together

Page 19: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

The cloud creates more & smaller specialized clusters for each application—which can turn into silos

Object Store Object Store

Page 20: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

And many problems are a combination of SQL & predictive, batch & online

Enterprise Data Warehouse

ApplicationsData Sources Operational Data Stores

Traditional

Architecture Enterprise Data Warehouse

ServeELT

Archive

BI System

Modeling

Reporting

ETL

HPC GRID

Storage #2

Storage #1

Ingest

Pro

cess L

oa

d

Unstructured

Financial

Ledger P&L

Risks

Market,

Counterparty,

Ratings

Payments

Collections

Charges

Ingest

Ingest

Portfolio

Contracts

Portfolio

Page 21: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Common Operations

Object Store Object Store

Developer

Workbench

Common

Governance

Common Security

An Enterprise Data Hub in the cloud

Common: Operations, Governance, Security, Schema, Catalog

SQL WorkbenchPartner Ecosystem

Page 22: Cloud strategies that work for governmentcdn.govexec.com/media/cloud_strategies_that_work... · Kudu Analytic DBMS Data Engineering Operational DBMS Data Science. O P E R A T I O

Thank you