DevOps in the Amazon Cloud – Learn from the pioneersNetflix suro

Netflix Data Pipeline

Sudhir Tonse (@stonse)Danny Yuan (@g9yuayon)

photo credit: http://www.flickr.com/photos/decade_null/142235888/sizes/o/in/photostream/!

Netflix is a log generating company that also happens to stream movies

- Adrian Cockroft

Data Is the most important asset at Netflix

If all the data is easily available to all teams, it can be leveraged in new and

exciting ways

Dashboard

~1000 Device Types

Dashboard

~1000 Device Types

~500 Apps/Web Services

Dashboard

~1000 Device Types

~500 Apps/Web Services

~100 Billion Events/Day !3.2M messages per second at peak time !3GB per second at peak time

Dashboard

Type of Events• User Interface Events • Search Event (‘Matrix’ using PS3 …) • Star Ra>ng Event (HoC : 5 stars, Xbox, US, …)

• Infrastructural Events • RPC Call (API -‐> Billing Service, ‘/bill/..’, 200, …) • Log Errors (NPE, “Movie is null”, …, …)

• Other Events … !!

Making Sense of Billions of Events

http://netflix.github.io+

ElasticSearchDruid

A Humble Beginning

Evolution …Scale!

ApplicationApplication

Application Application

Application

ApplicationApplication

We Want to Process App Data in Hadoop

Our Hadoop Ecosystem

@NetflixOSS Big Data Tools

Hadoop as a Service

Pig Scripting on Steroids

Pig Married to Clojure

S3MPER

S3mper is a library that provides an additional layer of consistency checking on top of Amazon's S3 index through use of a consistent, secondary index.

S3mper is a library that provides an additional layer of consistency

checking on top of Amazon's S3 index through use of a consistent, secondary index.

Efficient ETL with Cassandra

Cassandra

Offline Analysis

Evolution … Speed!

hgrep -C 10 -k 5,2,3 'users.*[1-9]{3}' *catalina.out s3//bucket

We Want to Aggregate, Index, and Query Data in Real Time

Interactive Exploration

Let’s walk through some use cases

client activity event

*/name = “movieStarts”

Pipeline Challenges

• App owners: send and forget

Pipeline Challenges

• Data scientists: validation, ETL, batch processing

Pipeline Challenges

• Data scientists: validation, ETL, batch processing

• DevOps: stream processing, targeted search

Message Routing

We Want to Consume Data Selectively in Different Ways

• Message broker!

• High-throughput!

• Persistent and replicated

There Is More

Intelligent Alerts

Guided Debugging in the Right Context

What We Need

• Ad-hoc query with different dimensions

What We Need

• Quick aggregations and Top-N queries

What We Need

• Quick aggregations and Top-N queries• Time series with flexible filters

What We Need

• Quick aggregations and Top-N queries• Time series with flexible filters• Quick access to raw data using boolean queries

What We Need

• Rapid exploration of high dimensional data!

• Fast ingestion and querying!

• Time series

• Real-time indexing of event streams!

• Killer feature: boolean search!

• Great UI: Kibana

The Old Pipeline

The New Pipeline

There Is More

It’s Not All About Counters and Time Series

RequestId Parent Id Node Id Service Name Status

4965-4a74 0 123 Edge Service 200

4965-4a74 123 456 Gateway 200

4965-4a74 456 789 Service A 200

4965-4a74e 456 abc Service B 200

Status:200

Distributed Tracing

A System that Supports All These

A Data Pipeline To Glue Them All

Make It Simple

Message Producing

• Simple and Uniform API

• messageBus.publish(event)

Consumption Is Simple Too consumer.observe().subscribe(new Subscriber<>() { @Override public void onNext(Ackable<IncomingMessage> ackable) { process(ackable.getEntity(MyEventType.class)); ackable.ack(); } }); !consumer.pause(); consumer.resume()

RxJava

• Functional reactive programming model!

• Powerful streaming API!

• Separation of logic and threading model

Design Decisions

• Top Priority: app stability and throughput

Design Decisions

• Asynchronous operations

Design Decisions

• Aggressive buffering

Design Decisions

• Aggressive buffering

• Drops messages if necessary

Anything Can Fail

Cloud Resiliency

Fault Tolerance Features

• Write and forward with auto-reattached EBS (Amazon’s Elastic Block Storage)

• disk-backed queue: big-queue

• Customized scaling down

There’s More to Do

• Contribute to @NetflixOSS !

• Join us :-)

Summaryhttp://netflix.github.io

+ElasticSearchDruid

You can build your own web-scale data pipeline using open source components

Thank You!Sudhir Tonse http://www.linkedin.com/in/sudhirtonse Twitter: @stonse

Danny Yuan http://www.linkedin.com/pub/danny-yuan/4/374/862 Twitter: @g9yuayon

DevOps in the Amazon Cloud – Learn from the pioneersNetflix suro

Technology

DEVOPS AND CONTINUOUS DELIVERY ENTERPRISE DEVOPS …...DEVOPS AND CONTINUOUS DELIVERY ENTERPRISE DEVOPS STRATEGY ACCELERATOR 2 DevOps and Continuous Delivery are areas of best practice

OpenStack and DevOps - DevOps Meetup

Dev Programming Ops DevOps Success...DevOps Success Damon Edwards @damonedwards dev2ops.org DevOps Cafe Disclosure: DevOps (to me) Disclosure: DevOps (to me) DevOps is not •a speciﬁc

devops, platforms and devops platforms

Learn how ZolonTech bridges the gap between DevOps and Security DevSecOps Whit… · With the evolution of DevOps, automation of developing, testing and deploying the code has become

AWS Certified DevOps · 2019-08-28 · Introduction To DevOps On Cloud 1. Understanding DevOps 2. Lifecycle of DevOps 3. Why DevOps on Cloud? 4. Introduction to AWS 5. DevOps using

DevOps for Software Engineering · DevOps combines Development teams and operations team to bridge gap and enhances both DevOps Key Aspects We learn how a DevOp tool would work More

DevOps Oxford- DevOps + BigData @ RealTime

How DevOps engineer can advance real DevOps...Influencing DevOps without Authority How "DevOps engineer" can advance real DevOps

1 suro presentasi

Collaborative DevOps Learn the magic of Continuous Deliverypublic.dhe.ibm.com/...DevOps...Continuous-Delivery.pdf · Software Delivery Challenges: what we hear from customers 4 Simplified

Adjusted SuRo Capital Corporation - Further Adjustment

InTech-Development of a Hovering Type Intelligent Autonomous Underwater Vehicle p Suro

DevOps @ Enterprise - DevOps Meetup Zurich

Molsa, suro paper de plata - Boileau Editorial de Música...4 Molsa, suro, paper de plata– Marc Timón (Francesc Parcerisas) Editorial BOILEAU Fitxa tècnica de l’estrena Els tres

Splunk Delivering the Promise of DevOps · The DevOps future state workshop is an opportunity for customers to learn how Splunk is being used in the market to provide actionable insights

Netflix suro begins

FOSTERING THE THIRD WAY - YOUR DEVOPS DOJO · WHAT IS A DEVOPS DOJO? DevOps Dojos are immersive learning experiences where full-stack product teams learn new practices and processes,

Agile & DevOps The Past, Present & Future of · The Past, Present & Future of Agile & DevOps Jen Krieger & Stef Walter Products & Technologies May 8, 2018 “Learn from yesterday,

DEVOPS - Agile Leadership Network Houstonalnhouston.org/.../DevOps-Changing-Culture-October-2015.pdf · 2015-10-22 · 1. LEARN TO TRUST • Core to DevOps culture • Dev / Ops