Agile Machine Learning for Real-time Recommender Systems

johann@ifwe.co@jssmith github.com/ifwe

Johann Schleier-Smith CTO, if(we)

what it should look like

1. Gain understanding of machine learning

2. Gain understanding of the product usage

3. See opportunity to make the product better

4. Create training data

5. Train predictive models

6. Put models in production

7. See improvements

what it often looks like

4. Pull records from database to create interesting features (usually aggregates)

6. Go implement models for production

7. See improvements

3-6 months

7. See improvements Cool!Was it worth it?

• Profitable startup actively pursuing big opportunities in social apps

• Millions of users of existing brands

• Thousands of social contacts per second

real-time recommendations

challenges

• >10 million candidates to select from

• >1000 updates/sec

• Must be responsive to current activity

• Users expect instant query results

Tagged dating feature

implementation pain points

• Data scientist hands model description to software engineer

• May need to translate features from SQL to Java

• Aggregate features require batch processing

• May need to adjust features and model to achieve real-time updates

• Fast scoring requires high-performance in-memory data structures

time for new thinking

one way thatworks better

Create interesting features

Train predictive models

Put models in production

Create interesting features

Train predictive models

Put models in production

event history

one right way to data

History. filterTime(start, PLUS_INFINITY). foreach { e: Event => model.update(e) }

everything is an event

Bob registers Alice registers

Alice updates profile Bob opens app

Bob sees Alice in recommendations Bob swipes yes on Alice

Alice receives push notification Alice sees Bob swiped yes

Alice swipes yes Alice sends message to Bob

writing the model

class MyModel { def update(e: Event) { … } def topN(ctx: Context, n: Int) = { … } }

models are allabout features

class MyFeature { def update(e: Event) { … } def score(ctx: Context, candidateId: Long): Double = { … } }

model training

History. filterTime(start, PLUS_INFINITY). foreach { e: Event => { writeTrainingData(outcome(e), model.features(context(e)) model.update(e) } }

live demo

Kaggle competition with Best Buy data

https://www.kaggle.com/c/acm-sf-chapter-hackathon-small

product update events{ “timestamp” : “2012-05-03 6:43:15”, “eventType” : “ProductUpdate”, “eventProperties” : { “sku” : “1032361”, “regularPrice” : “19.99”, “name” : “Need for Speed: Hot Pursuit”, “description” : “Fasten your seatbelt and get ready to drive like your life depends on it...” ... } }

product view events

{ “timestamp” : “2011-10-31 09:48:46”, “eventType” : “ProductView”, “eventProperties” : { “skuSelected” : “2670133”, “query” : “Modern warfare” } }

Try it yourself, code and instructions at: https://github.com/ifweco/antelope/blob/master/doc/demo.md

4. Create training data

6. Put models in production

7. See improvementsFa

st cycles!!

• All data in form of events – no exceptions!

• Roll through history to generate training examples

• Sample training data carefully to avoid feedback

• Model is static while features are live and personal

• Use interesting features with boring algorithms

• Expressiveness > performance > scalability

github.com/ifwe/antelope @jssmith

Agile Machine Learning for Real-time Recommender Systems

Software

Advances of Deep & Reinforcement Learning on Recommender ... · Factorization Machine •Incorporate all possible information for recommender systems •One-hot encoding for each

Off-line vs. On-line Evaluation of Recommender Systems in ...them. The core of recommender systems are machine learning al-gorithms applied on the matrix of user to object preferences

Alexander Vodyaho & Nataly Zhukova — Implementation of Agile Concepts in Recommender Systems for Data Processing and Analyses

Building a Book Recommender - Machine Learningcs229.stanford.edu/proj2019aut/data/assignment... · Building a Book Recommender can be divided into three core goals each evaluated

Multimodal trust based recommender system with machine

Recommender Systems an Introduction Chapter07 Evaluating Recommender Systems

Recommender Systems - University of Washingtoncourses.cs.washington.edu/.../14wi/slides/recommenders.pdfRecommender Systems Machine Learning – CSEP546 Carlos Guestrin University

Machine Learning Project Recommender Systempages.cpsc.ucalgary.ca/~mrichter/ML/Older... · Clear Goal • Recommender System for Music. • e.g. similar to Amazon’s Recommendation

A Machine Learning Recommender Model for Ride Sharing

Recommender Introduction to Recommender Systems and

Agile large-scale machine-learning pipelines in drug discovery

Introduction to Recommender Systems - aris.mearis.me/.../data-mining-2015-fall/.../recommender.pdf · data mining, machine learning, ... I Recommender Systems are a particular type

INTRODUCTION TO RECOMMENDER - Leuphana … · SYSTEMS AND THEIR EVALUATION Olga ... recommender systems Introduction Properties of recommender ... 11 Tutorial: Recommender Problems

Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale

Recommender Systems - Universidade NOVA de Lisboactp.di.fct.unl.pt/~jmag/ws/slides/b08 Recommender systems.pdf · Recommender systems •Recommender systems aim at suggesting new

Develop Hybrid Mobile Applications with Apache …...• Learning Webinars on big topics (Cloud/Mobile Development, Cybersecurity, Big Data, Recommender Systems, SaaS, Agile, Machine

Mathematics for Machine Learning - Henrik Bachmann · educba.com Supervised Learning Game Al Skill Acquisition Identity Fraud Feature Detection Elicitation Machine Learning Recommender

Machine-Learning Recommender Systems for C2 of Autonomous ... · Machine-Learning & Recommender Systems for C2 of Autonomous Vehicles Glennn Moy on behalf of Don Gossink, Glennn Moy,

Recommender Systems Recommender Systems

Towards Ontology-Based SQA Recommender for Agile Software ... · facility is a key requirement to address the role of SQA in agile software development. The rest of the paper is organized