24
Building the Ideal Stack for Machine Learning

Building the Ideal Stack for Machine Learning

  • Upload
    memsql

  • View
    116

  • Download
    3

Embed Size (px)

Citation preview

Building the Ideal Stackfor Machine Learning

2

Agenda

oMachine Learning Overview

o The Need for Real-Time

o Real-Time Machine Learning Use Cases

oDemo

MISSION

Make every company a real-time enterprise

PRODUCT

Database Platform: Real-Time | Scalable | ProvenRanked #1 Operational Data Warehouse

ABOUT

Founders are former Facebook, SQL Server database engineersYC alumni with $85m in funding from top tier investors

MemSQL at a Glance

MACHINE LEARNINGRunning a mathematical model on a data set, where the model is trained from data using some optimization technique

DATA SCIENCEBroader field of training, running, testing, and productizing data models

5

Example Types of Machine Learning

SUPERVISED LEARNINGRegressionClassification

UNSUPERVISED LEARNINGCluster Analysis

6

Example Machine Learning Pipeline

NEW DATA

FEATURES

PREDICTION

7

Example Machine Learning Pipeline

NEW DATA

FEATURES

PREDICTION

Feature Extraction

ModelScoring

8

Example Machine Learning Pipeline

(Mathematical function)

(Arbitrary computation)Feature

Extraction

ModelScoring

NEW DATA

FEATURES

PREDICTION

9

Example Machine Learning Pipeline

Model Accuracy

Feature Extraction

ModelScoring

NEW DATA

FEATURES

PREDICTION

LABELED HISTORICAL DATA

FEATURES

PREDICTION

LABELS

Validation

10

Example Machine Learning Pipeline: Fraud Detection

Model Accuracy

Feature Extraction

ModelScoring

NEW DATA

FEATURES

PREDICTION

LABELED HISTORICAL DATA

FEATURES

PREDICTION

LABELS

Validation

Previously labeled credit card

transactions

X% of predictions are accurately predicted to

be fraud

New credit card transactions

Yes/No fraud prediction

BI DASHBOARD

11

Building your Machine Learning Stack for Experimentation

Model Accuracy

LABELED HISTORICAL DATA

FEATURES

PREDICTION

LABELS

PROGRAMMING STACKPython/NumPy StackROthers: Matlab, Julia

VISUALIZATION TOOLInteractiveCustomizableSharable

12

Building your Machine Learning Stack for Productization

NEW DATA

FEATURES

PREDICTION

o Performance

o Message Delivery Semantics

o Message Queues

o Distributed Computation

o Persistent Database

o Business Intelligence Tool

Going Real-Time is the Next Phase for Big Data

MORE SENSORS

MORE INTERCONNECTIVITY

MOREUSER DEMAND

14

Real-Time Machine Learning

Model Accuracy

Feature Extraction

ModelScoring

NEW DATA

FEATURES

PREDICTION

LABELED HISTORICAL DATA

FEATURES

PREDICTION

LABELS

Validation

15

Real-Time Machine Learning

Model Accuracy

Feature Extraction

ModelScoring

NEW DATA

FEATURES

PREDICTION

LABELED HISTORICAL DATA

FEATURES

PREDICTION

LABELS

Validation

Machine Learning Industry Examples

New O’REILLY BOOK:

The Path to Predictive Analytics and Machine Learning

memsql.com/oreillyml

Real-timewebsite

traffic data

Spark MLlib Predictive Model

Advertising Optimization in AdTech Industry

Determine the best ads to serve to users based on previous behavior and market segmentation

Machine learning model implemented using Spark MLlib

BUSINESSLOGIC

18

Real-TimeAnomaly Alerts

...

Sensor Activity

Anomaly Detection in Energy Industry

Proactively detect non-optimally functioning oil drills based on sensor activity, and correct issues before damage is done

Machine learning model is SAS model exported in PMML format run inside Spark

PMML Predictive Model

Facial imagecontent

Machine Learning

Model

Image recognitiondashboard

Image Detection in Social Media Industry

Find matches for faces found in millions of image files, and determine if the face matches a set of known “friends”

Machine learning model is a vector dot product calculation between image feature vectors, all done in MemSQL

MNISTdata

MemSQLPipeline SQL

Leverage the TensorFlow machine learning library to predict images in the MNIST training data set, and store predictions in MemSQL

MNIST Data Set Demo

DEMO

Q&A

Thank You!Try MemSQL: memsql.com/downloadContact: [email protected]: @memsql