Nearline systems to improve Netflix recommendations

Near line systems to improve Netflix recommendations

Gopal Krishnan

Feb 2015

About me

Gopal Krishnan

Director, Consumer Science Engineering

Netflix, Inc.

Driving innovation through AB testing the member experience.

Twitter: @sgkrishnan

LinkedIn: https://www.linkedin.com/pub/gopal-krishnan/0/7a7/905

Netflix: global streaming video service for TV and movies

Netflix is available on 1000+ devices

More than 57M members globally

• In more than 50 countries

• Planning to launch in all (200+) countries in 2 years.

Netflix Consumes 34% of peak downstream bandwidth in North America

Netflix Consumes 6% of peak upstream bandwidth in North America

What my team does?

• Help improve rate of innovation through AB testing to improve member experience

• Infrastructure for algorithmic support

– Feature value store to help model training

– Services to store and serve explicit data sources

– Services to collect, process, validate, and serve implicit data sources

– Caching services

• Data improves our understanding of end to end user behavior

Every part of Netflix is personalized

NETFLIX RECOMMENDATIONS WITH ONLINE MICRO SERVICES

Life Cycle of Netflix Recommendation Data

Devices

Data Collection

Offline Big Data Analysis

Netflix recommendation:

online services

Netflix API Netflix beacon telemetry

Data Collection: explicit inputs

Star ratings

Data Collection: explicit inputs

Virtual plays from new user on-boarding

Outputs from offline analysis

Devices

Data Collection

online services

“Implicit” Data Services

Popularity Targeting

User clustering

Recommendations combines both online and aggregated offline data

Devices

Data Collection

online services

“Explicit” Data Services

My List On Ramp

Taste pref

Popularity Targeting

User clustering

WHY BOTHER WITH NEAR LINE SYSTEMS THEN?

Our algorithms became too complex to be computed online leading to higher latency.

Near line systems improve our availability story.

Near line systems allow us to innovate at a greater velocity.

Near line systems improve agility and availability

Devices

Data Collection

Big Data Analysis(Hadoop, Teradata)

online services

Pre-computed recommendations

Post-processat run time

Manhattan pre-compute engine

Manhattan: Netflix pre-compute engine

Video Ranker

Row selection

Similars

Top picks

What data would improve recommendations even further?

All UI Events from all key platforms

• Moving beyond explicit inputs from users, we would like to track all member activity to derive deeper insights.

• Challenges include:

– 1000s of device platforms

– Non-standardized UIs across different platforms

– Lack of earlier focus on tracking the browse experience

Patterns arise in aggregate

Challenges with collecting UI Events

• Consistent data semantics across lots of device and UI platforms.

• Scaling to handle billions of events.

• Near real-time semantic data quality and validation

• Dealing with data loss (low power devices, loss at the network, etc.)

Canaries for data quality

Near real time feedback and validation on data quality.

“Trending” on Netflix

Now being AB tested

Near line systems for Netflix recommendations

Devices

Data Collection

Big Data Analysis(Hadoop, Teradata)

online services

Pre-computed recommendations

Post-processat run time

Near line data processing and serving

systems

“Trending on Netflix” near line system

Take rates (play/impression)kafka stream

Cassandra

dashboards

StreamProcessing(ETA: low # of minutes)

Play start(kafka stream)

1000’s / sec

Impressions (kafka stream)

millions / sec

“Trending on Netflix” near line system

Play start(kafka stream)

1000’s / sec

Impressions (kafka stream)

millions / secStream ProcessingWindowed operations.Small batches.Merging streams.Flexibility.

Take rates

Impressions rollup

Personalized Ranked videos

Merged to generate “Trending on Netflix”

Spark Streaming at Netflix

• Collaborating with Databricks to make sure Spark (batch and streaming) works well in a cloud environment

– Resiliency and scalability testing

• Actively working on studying scaling needs for algorithmic needs for both Spark batch and Spark streaming.

Spark at Netflix

• Several different use cases where we are interested in Spark – both batch and streaming.

• Largest Spark batch production cluster is 150 m3.2xl instances for personalization.

• Netflix has both Spark batch and Spark streaming in production.

Spark at Netflix

• Integrating with Spark with Scala (mostly), python, and some SQL.

• Python typically via iPython notebook integration.

• Running in standalone mode or in mesos.

Spark: areas to watch for.

• We have really not tested the multi-tenancy boundaries yet. Mostly spinning custom purpose clusters for now.

• Tuning the jobs and optimizing performance of jobs remains a challenge as we make steady inroads.

• Incrementally getting better with stability and scale as we tackle larger use cases this year.

Netflix Tech Blog

• Tech blog about the “Trending on Netflix” row published today.

• Watch for upcoming tech blog from Netflix on near line systems and another one about Spark in the coming weeks.

Now Hiring leaders and engineers!

Talk to me in person or at

Twitter: @sgkrishnan

LinkedIn:https://www.linkedin.com/pub/gopal-krishnan/0/7a7/905

Nearline systems to improve Netflix recommendations

Engineering

Netflix 2009

Netflix - Τarpov

StorageTek Nearline Control Solution (MSP Implementation) · StorageTek Nearline Control Solution (MSP Implementation) ... Re-assembling the SLUCONDB ... Support

Netflix Recommendations Using Spark + Cassandra (Prasanna Padmanabhan & Roopa Tangirala, Netflix) | Cassandra Summit 2016

Informatica ILM Nearline 6.1 User Guide Documentation/1/INL_61_UserGuide_en.pdfInformatica ILM Nearline for use with SAP NetWeaver BW (Version 6.1) User Guide

Desktop, Nearline & Enterprise Disk Drives Nearline & Enterprise Disk Drives What’s the difference? For the past twenty five years the storage marketplace has been divided into two

Nearline Information Development Storage Technology Corporation

Granular Archival and Nearline Storage Using MySQL, S3 and SQS Presentation

Netflix customer service 1 888 811 4532 Netflix Customer Support

SAND CDBMS Nearline for SAP BW - Informatica · Accelerating the largest user populations, the biggest data, and the most complex analyticsSM SAND CDBMS Nearline for SAP BW Backup

Adding Privacy to Netflix Recommendations Frank McSherry, Ilya Mironov (MSR SVC) Attacks on Recommender Systems — No “blending in”, auxiliary information

Workflow-connected nearline storage - Avid€¦ · Workflow-connected nearline storage ... • Dell Force10 Networks ° S25N: 24 10/100/1000Base-T; ... ° S4810: 48 Configurable

Migrating to Google Cloud Storage Nearline From Amazon Glacier€¦ · Migrating to Google Cloud Storage Nearline PAUL NEWSON | 07/23/15 you need to decide how to map the names from

Docker & ECS: Secure Nearline Execution

Sand dna nearline for sap net weaver bw 7.0

Big & Personal: the data and the models behind Netflix recommendations by Xavier Amatriain

The future of data archives and nearline storage future of data archives and nearline storage Simon Watkins WW StoreEver Product Marketing Manager. HP Storage. 2 ... Oracle SL150

SAP BW Archiving with Nearline Storage at Esprit TT 2013 Esprit Hahne Consulting V11… · SAP BW Archiving with Nearline Storage at Esprit Claudia Ottilige, Esprit Europe GmbH Dr

Adding Privacy to Netflix Recommendations Frank McSherry, Ilya Mironov (MSR SVC)

Netflix support phone number | Netflix Support Help Centre