The DAP - Where YARN, HBase, Kafka and Spark go to Production

Preview:

Citation preview

The DAP – Where YARN, HBase, Kafka and Spark Go to Production

Hadoop Summit - June 30th, 2016

cask.co

Cask, CDAP, Cask Hydrator and Cask Tracker are trademarks or registered trademarks of Cask Data. Apache Spark, Spark, the Spark logo, Apache Hadoop, Hadoop and the Hadoop logo are trademarks or registered trademarks of the Apache Software Foundation. All other trademarks and registered trademarks are the property of their respective owners.

cask.co

About Me

2

cask.co

The Many Faces of Hadoop

3

Developer Data Scientist IT Pro / Ops

LOB Manager

Advanced Programming

Focuses on App Logic

Basic Programming

Focuses on Data

Configuration & Monitoring

Focuses on Operations

Analysis & Decision Making

Focuses on Insights

cask.co

Big Data Challenges

4

cask.co

Building a Big Data App

5

cask.co

Deploying and Operating a Big Data App

6

cask.co

Today’s Integration Solutions are Silo’ed

7

Data Integration App Integration Cloud Integration Governance

cask.co

Introducing the DAP

8

cask.co9

Enter Cask

Key Customers and Partners

Named a Gartner Cool Vendor 2016

Founded in 2011 by early Hadoop engineers from Facebook and Yahoo!

cask.co

Introducing the Cask Data App Platform

10

cask.co

CDAP Overview

11

Open Source, Integrated Framework for Building and Running Data Applications on Hadoop and Spark

cask.co12

● Provides a platform with framework level

correctness

● Dataset abstractions & self-service data

● One framework: Prototype to Production

● Unified approach across all paradigms

○ Metrics & Log collection

○ Lineage, Audit, Access Control

CDAP Consolidates Big Data App Lifecycle

cask.co

CDAP Extensions

13

cask.co

CDAP Architecture

14

● Application Container Architecture

● Reusable Programming

Abstractions

● Global User and Machine Metadata

cask.co

CDAP Application Structure

15

cask.co

CDAP Deployment Architecture

16

cask.co

Hadoop in the Enterprise – Simplified with CDAP

17

cask.co

Common Use Cases

18

cask.co

Summary

19

cask.co

Thank You !

20

Recommended