39
Joseph E. Gonzalez [email protected]; Assistant Professor @ UC Berkeley [email protected]; Co-Founder @ Dato Inc. Intelligent Services Serving Machine Learning

Joseph E. Gonzalez [email protected]; Assistant Professor @ UC Berkeley [email protected]; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Embed Size (px)

Citation preview

Page 1: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Joseph E. [email protected]; Assistant Professor @ UC Berkeley

[email protected]; Co-Founder @ Dato Inc.

Intelligent ServicesServing Machine Learning

Page 2: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Contemporary Learning Systems

TrainingData Models

Big Big

Page 3: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Contemporary Learning Systems

MLlib

Create

MLCLIBSVMVW Oryx 2

BIDMach

Page 4: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

TrainingData Model

What happens after we train a model?

Dashboards and Reports

Conference

PapersDrive Actions

Page 5: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

TrainingData Model

What happens after we train a model?

Dashboards and Reports

Conference

PapersDrive Actions

Page 6: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Suggesting Itemsat Checkout

Fraud Detection

Cognitive Assistance

Internet ofThings

Low-Latency Personalized Rapidly Changing

Page 7: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

TrainData Model

Page 8: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

TrainData Model

Actions ServeAd

apt

Page 9: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

9

MachineLearning

IntelligentServices

Page 10: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

The Life of a Query in an Intelligent Service

Web

Ser

ving

Tie

rUser

ProductInfo

Inte

llige

nt S

ervi

ce

UserData

ModelInfo

LookupModel

FeatureLookup

FeatureLookup

Top-KQuery

Request:Items like x

New PageImages …

Top Items

Content Request

Feedback:Preferred Item

Feedback

μ σ

ρ∑

∫α

βmath

Page 11: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Essential Attributes of Intelligent Services

ResponsiveIntelligent applications

are interactive

AdaptiveML models out-of-date the

moment learning is done

ManageableMany models

created by multiple people

Page 12: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Responsive: Now and AlwaysCompute predictions in < 20ms for complex

under heavy query load with system failures.

Models Queries

Top

K

Features

SELECT * FROMusers JOIN items,click_logs, pagesWHERE …

Page 13: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Experiment: End-to-end Latency in Spark MLlib

To JSON

HTTP Req. Feature Trans.

Evaluate Model

Encode Prediction

HTTP Response4

Page 14: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

0

100

200

300

400

4.3 21.8 22.4

137.750.5

172.6

418.7Late

ncy

in M

il-lis

eco

nds

Page 15: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Adaptive to Change at All Scales

Months Rate of Change Minutes

Population Granularity of Data Session

Shoppingfor Mom

Shoppingfor Me

Page 16: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Adaptive to Change at All Scales

Months Rate of Change Minutes

Population Granularity of Data Session

Shoppingfor Mom

Shoppingfor Me

Population Law of Large Numbers Change Slow

Rely on efficient offline retraining

High-throughput Systems

Months

Page 17: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Adaptive to Change at All Scales

Months Rate of Change Minutes

Population Granularity of Data Session

Small Data Rapidly

Changing

Low Latency Online

Learning

Sensitive to feedback bias

Shoppingfor Mom

Shoppingfor Me

Page 18: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

The Feedback LoopI once looked at cameras on Amazon …

My Amazon Homepage

Similar cameras and

accessoriesOpportunity for Bandit Algorithms

Bandits present new challenges:• computation overhead• complicates caching + indexing

Page 19: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Management: Collaborative Development

Teams of data-scientists working on similar tasks“competing” features and modelsComplex model dependencies:

Cat PhotoisCat

CutenessPredictor

Cat Classifier AnimalClassifier Cute!

isAnimal

Page 20: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Predictive Services

UC Berkeley AMPLabDaniel Crankshaw, Xin Wang, Joseph Gonzalez

Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi,

and Michael I. Jordan

Page 21: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Predictive Services

UC Berkeley AMPLabDaniel Crankshaw, Xin Wang, Joseph Gonzalez

Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi,

and Michael I. Jordan

Active Research Project

Page 22: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Velox Model Serving System

Focuses on the multi-task learning (MTL) domain

[CIDR’15, LearningSys’15]

SpamClassification

Content Rec.Scoring

Session 1:

Session 2:

LocalizedAnomaly Detection

Page 23: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Velox Model Serving SystemPersonalized Models (Mulit-task Learning)

[CIDR’15, LearningSys’15]

Input Output

“Separate” model for each user/context.

Page 24: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Personalized Models (Mulit-task Learning)

Split

PersonalizationModel

FeatureModel

Velox Model Serving System[CIDR’15, LearningSys’15]

Page 25: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Hybrid Offline + Online Learning

Split

PersonalizationModel

FeatureModel

Update the user weights online:• Simple to train + more robust model• Address rapidly changing user

statistics

Update feature functions offline using batch solvers• Leverage high-throughput systems

(Apache Spark)• Exploit slow change in population statistics

Page 26: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Hybrid Online + Offline Learning Results

Similar Test Error Substantially Faster Training

User Pref. Change

Hybrid

Offline Full

HybridOffline Full

Page 27: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Evaluating the Model

Split

CacheFeature Evaluation

Input

Page 28: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Evaluating the Model

Split

CacheFeature Evaluation

Input

Feature Caching

Across Users

Anytime Feature

Evaluation

ApproximateFeature Hashing

Page 29: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Feature Caching

Feature Hash Table

Hash input:

Compute feature:

New input:

Page 30: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

LSH Cache Coarsening

Hash new input: Use Wrong Value!

LSH hash fn.

New input

Feature Hash Table

Page 31: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

LSH Cache Coarsening

Feature Hash Table

Hash new input: Use Value Anyways!

Req. LSH

Locality-Sensitive Hashing:

Locality-Sensitive Caching:

Page 32: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Anytime PredictionsCompute features asynchronously:

if a particular element does not arrive use estimator instead

Always able to render a prediction by the latency deadline

__ __ __

Page 33: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

No C

oarse

nin

g

Coarsening + Anytime Predictions

Overl

y C

oars

en

ed More Features

Approx. Expectation

Better

Best

Coarser Hash

Checkout our poster!

Page 34: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Spark Streami

ng SparkSQL

Graph X ML

library

BlinkDB

MLbase Velox

Training Management + Serving

Spark

HDFS, S3, … Tachyon

ModelManag

er

Prediction

ServiceMesos

Part of Berkeley Data Analytics Stack

Page 35: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Predictive Services

UC Berkeley AMPLabDaniel Crankshaw, Xin Wang,

Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi,

and Michael I. Jordan

Page 36: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Dato Predictive Services

Elastic scaling and load balancing of docker.io containers

AWS Cloudwatch Metrics and Reporting

Serves Dato Create models, scikit-learn, and custom python

Distributed shared caching: scale-out to address latency

REST management API: Demo?

Production ready model serving and management system

Page 37: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Predictive Services

UC Berkeley AMPLabDaniel Crankshaw, Xin Wang, Joseph Gonzalez

Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi,

and Michael I. Jordan

Caching, Bandits, &Management

Online/Offline LearningLatency vs. Accuracy

Key Insights:

Responsive

Adaptive

Manageable

Page 38: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

TrainData

Model

ActionsServeAd

apt

Future of Learning Systems

Page 39: Joseph E. Gonzalez jegonzal@cs.berkeley.edu; Assistant Professor @ UC Berkeley joseph@dato.com; Co-Founder @ Dato Inc. Intelligent Services Serving Machine

Thank You

Joseph E. [email protected], Assistant Professor @ UC Berkeley

[email protected], Co-Founder @ Dato