20
The DAP – Where YARN, HBase, Kafka and Spark Go to Production Hadoop Summit - June 30th, 2016 cask.co Cask, CDAP, Cask Hydrator and Cask Tracker are trademarks or registered trademarks of Cask Data. Apache Spark, Spark, the Spark logo, Apache Hadoop, Hadoop and the Hadoop logo are trademarks or registered trademarks of the Apache Software Foundation. All other trademarks and registered trademarks are the property of their respective owners.

The DAP - Where YARN, HBase, Kafka and Spark go to Production

Embed Size (px)

Citation preview

Page 1: The DAP - Where YARN, HBase, Kafka and Spark go to Production

The DAP – Where YARN, HBase, Kafka and Spark Go to Production

Hadoop Summit - June 30th, 2016

cask.co

Cask, CDAP, Cask Hydrator and Cask Tracker are trademarks or registered trademarks of Cask Data. Apache Spark, Spark, the Spark logo, Apache Hadoop, Hadoop and the Hadoop logo are trademarks or registered trademarks of the Apache Software Foundation. All other trademarks and registered trademarks are the property of their respective owners.

Page 2: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

About Me

2

Page 3: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

The Many Faces of Hadoop

3

Developer Data Scientist IT Pro / Ops

LOB Manager

Advanced Programming

Focuses on App Logic

Basic Programming

Focuses on Data

Configuration & Monitoring

Focuses on Operations

Analysis & Decision Making

Focuses on Insights

Page 4: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Big Data Challenges

4

Page 5: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Building a Big Data App

5

Page 6: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Deploying and Operating a Big Data App

6

Page 7: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Today’s Integration Solutions are Silo’ed

7

Data Integration App Integration Cloud Integration Governance

Page 8: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Introducing the DAP

8

Page 9: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co9

Enter Cask

Key Customers and Partners

Named a Gartner Cool Vendor 2016

Founded in 2011 by early Hadoop engineers from Facebook and Yahoo!

Page 10: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Introducing the Cask Data App Platform

10

Page 11: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

CDAP Overview

11

Open Source, Integrated Framework for Building and Running Data Applications on Hadoop and Spark

Page 12: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co12

● Provides a platform with framework level

correctness

● Dataset abstractions & self-service data

● One framework: Prototype to Production

● Unified approach across all paradigms

○ Metrics & Log collection

○ Lineage, Audit, Access Control

CDAP Consolidates Big Data App Lifecycle

Page 13: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

CDAP Extensions

13

Page 14: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

CDAP Architecture

14

● Application Container Architecture

● Reusable Programming

Abstractions

● Global User and Machine Metadata

Page 15: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

CDAP Application Structure

15

Page 16: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

CDAP Deployment Architecture

16

Page 17: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Hadoop in the Enterprise – Simplified with CDAP

17

Page 18: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Common Use Cases

18

Page 19: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Summary

19

Page 20: The DAP - Where YARN, HBase, Kafka and Spark go to Production

cask.co

Thank You !

20