25
© 2017 MapR Technologies MapR Confidential 1 MapR 6.0 Powers DataOps: Unleash the Value of Your Data with New Features in the MapR Converged Data Platform Mitesh Shah, Director Product Marketing, MapR Prashant Rathi, Sr. Product Manager, MapR December 5, 2017

MapR 6.0 Powers DataOps

Embed Size (px)

Citation preview

Page 1: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 1

MapR 6.0 Powers DataOps:Unleash the Value of Your Data with New

Features in the MapR Converged Data Platform

Mitesh Shah, Director Product Marketing, MapR

Prashant Rathi, Sr. Product Manager, MapR

December 5, 2017

Page 2: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 2

No Friction

Turning Data into Value is Easy When There is No Friction

(there’s always friction)

Data Value

Systems that are inflexible, hard to manage, insecure, …

People Friction

Process Friction

Technology Friction

Waterfall not agile.

Cumbersome audit and compliance requirements.

Organizational silos.

Page 3: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 3

What is DataOps?

+

DevOps Data EngineersData Scientists

=

DataOps helps an organization rapidly deliver value from data by supporting agility and accelerating and enabling the integration of operations and analytics.

Day Zero Operations

Embrace Data Flows

Always On

All Data

Secure the Data Not Access Method

Self-service Not Dependency

Convergence Not Orchestration

Distributed

DataOpsPrinciples

DataOps

Page 4: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 4

MapR Powers DataOps to Unleash Greater Value

from All Data

Now available, MapR Converged Data Platform 6.0 adds innovations for

security, database, and automated administration, across clouds

• Real-time Data Integration with Innovations in MapR-DB• Self-service Data Science with Data Science Refinery• Secure Data with Single-Click Security Enhancements• Cloud-scale Multi-Tenancy and Edge to Cloud File Migrate• Automatic Platform Health and Security with the New MCS

Page 5: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 5

Real-time Data Integration with Innovations in MapR-DB

Page 6: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 6

MapR-DB Innovations in 6.0

• Integrated Operational DB for Mission Critical Apps

• Horizontal Scalability

• Extreme Performance

• 24X7 Reliability

• HBase API compatible

HIGH PERFORMANCE WIDE COLUMN

DATABASE

• Cross Data Center active/active Replication

• Fine Grained Security controls

GLOBAL DATABASE

• Native JSON Support

• Comprehensive Datatypes

• Granular & Efficient operations

• Trillions of documents, Millions of tables

• Open & Intuititve OJAI APIs

MULTI-MODEL DATABASE

W/DOCUMENT DATA MODEL

• Native Secondary indexes

• Rich OJAI 2.0 Query APIs

• Optimized Drill/SQL analytics & BI

• Advanced Analytics w/Native Spark and Hive connectivity

• Global Real-time Change Data Capture

DATABASE FOR GLOBAL DATA-

INTENSIVE APPS

MapR-DB 6.0 release

Page 7: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 7

Real-Time Data Integration and Micro-Services w/Global

Change Data Capture

Allows arbitrary external systems to consume changes in MapR-DB tables globally

Build Scalable real time data hubs for fast ingesting and fast ingesting big data

Enables Real-time event driven micro-services app fabrics to create rich experiences

Machine Learning

Models

Microservices

Elastic Search

Change Data CaptureRemote MapR-DB

Page 8: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 8

Self-service Data Science with Data Science Refinery

Page 9: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 9

The MapR Data Science VisionA Holistic Approach To Self-Service Data Science

MAPR DATA SCIENCE REFINERY REFINERY DATA SCIENTISTS

Data Scientist led product-and-

services offerings including Quick

Start Solutions (QSS) & Training

REFINERY PARTNERSHIPS

Expand on what we offer in-

product to meet the needs of all

data science teams

An easy-to-deploy, secure, and

extensible data science offering

that leverages all existing platform

assets

MAPR CONVERGED DATA PLATFORM

Page 10: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 10

Secure Data with Single Click Security Enhancements

Page 11: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 11

Criminals accessed, copied, and deleted data from unpatched or badly configured databases, and then held the data as ransom.

Cloud computing misconfiguration resulted in vulnerabilities that exposed information about 200 million voters, including names, dates of birth, home addresses, phone numbers, and voter registration details.

Recent Security Issues Caused by Misconfiguration

Over 35,000 servers were found open to the internet on AWS. Hundreds of instances were compromised and the data was held for ransom.

Major NoSQL DB

Major Cloud Storage Provider

Open Source Search Engine

Page 12: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 12

High Availability Real Time Security & Governance Multi-tenancy Disaster Recovery Global Namespace

Converge-X™ Engine

HDFS APIPOSIX, NFS HBase API JSON API Kafka API

MapR Introduces Single-Click Security Enhancements

Event Data Streams

Analytics &Machine Learning Engines

Operational Database

Cloud-scale Data Store

* Some exceptions apply.

Encryption on the Wire*Authentication

Enforcement

CLEA

RTEX

T A

ZDD

SAD

UX

Page 13: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 13

Cloud-scale Multi-Tenancy and Edge to Cloud File Migrate

Page 14: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 14

PRIVATE CLOUD

Cloud-scale Multi-tenancy

OpenStack Manila Plugin

PUBLIC CLOUD

Cloud-native Operations

Cloud Storage Integrations

Object Tiering

REST APIs

MULTI CLOUD

Mirroring

Replication

EDGE

Small Footprint

Edge to Cloud File Migrate

Data Queueing

Bandwidth Optimization

MapR Orbit Cloud Suite

Page 15: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 15

TenantSan Francisco Giants

TenantOakland Athletics

VMs Users & Groups

Analytical and Machine Learning Engines

Event Data StreamsCloud Scale Data Store

High Availability Real Time Security & Governance Multi-tenancy Disaster Recovery Global Namespace

Converge-X Data Fabric

Operational Database

Manila Plugin

MapR Volumes

VMs Users & Groups

MapR Volumes

• Hosting multiple organizations (users, groups) on the same data platform

• Security: Ensuring intra-organization privacy as well as intra-organization policies

Competitive Note: Capability not found in Hadoop competitors (CDH, HDP), NoSQL competitors (MongoDB, Couchbase), or scale-out storage competitors.

The Challenge:

• Tenant concept built-in to data services – all users identified by (tenant, user, [groups])

• Tenant volumes hidden from other tenants

• Enforce intra-tenant access control within volumes

• Integration with OpenStack Manila for tenant self-service provisioning of data shares (volumes)

How MapR Solves:

Cloud-Scale Multi-Tenancy & OpenStack Manila Plugin

(for native file access)

Page 16: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 16

Edge to Cloud File Migrate

The Challenge:

• Insufficient local compute for local analytics, need cloud.

• Existing ETL tools don’t meet reliability or time sensitivity requirements.

How MapR Solves:

• Edge to Cloud File Migrate service deploys to each edge site, watches MapR-XD for new files, immediately transfers to the cloud.

• Intelligent use of MapR metadata services to ensure performance and reliability.

Real-time, automatic movement of files from edge to the cloud

Ideal for mixed-processing workloads– some edge, some cloud

Page 17: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 17

The New MapR Control System

Page 18: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 18

Benefits of the New MCS

Greater Administrative Overhead

Higher OpEx

Increased Probability of Failures

Reduced Administrative Overhead through Unified Data Management

Lower OpEx

Unparalleled Cluster Stability and Health

MapR Converged Data Platform and The New MCSThe Other Guys (Crisis of Complexity)

Page 19: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 19

MCS Demo

Page 20: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 20

Summary

Page 21: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 21

MapR Converged Data Platform 6.0

• Architected to power DataOps

• Available Now

• Cloud provider marketplaces, such as Microsoft Azure, Amazon Web Services, and Oracle Cloud will have version 6.0 available before end of year

MapR 6.0 delivers:• Automatic Platform Health and Security• Real-time Data Integration• Secure, Discoverable Data• Self-Service Machine Learning / Artificial Intelligence

Page 22: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 22

Q&A

ENGAGE WITH US

@mapr

[email protected]

[email protected]

Page 23: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 23

Appendix

Page 24: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 24

The Trend from Data Warehousing to Data Science

Data Warehouses Data Lakes Limited Machine Learning Machine Learning Everywhere

Analysts Data Scientists

Need to Allow for Access to All DataFlexibility and Choice in Tools is Critical

Security

SecurityTargeted Offers

Fraud DetectionPredictive

Maintenance

Smart Cars

Targeted Offers

Page 25: MapR 6.0 Powers DataOps

© 2017 MapR TechnologiesMapR Confidential 25

ENTRY POINTS IN THE CUSTOMER JOURNEY

McKinsey calls these companies “Adopters”. Gartner estimates they solve between 10-100 business problems in three to five years.

McKinsey calls these companies “Partial Adopters & Experimenters”. Gartner estimates they solve between 3-20 business problems in three to five years.

McKinsey calls these companies “Contemplators”

Data Science Curious

Adjacent Data Science Teams

Corporate Data Science Teams

20%41%40%