87
Augury and Omens Aside, Part 1: The Business Case for Apache Mesos Apache Mesos NYC Meetup @Shutterstock 2014-06-11 Paco Nathan http://liber118.com/pxn/ @pacoid meetup.com/Apache-Mesos-NYC-Meetup/events/187583352/

Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Embed Size (px)

Citation preview

Page 1: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Augury and Omens Aside, Part 1: The Business Case for Apache Mesos Apache Mesos NYC Meetup @Shutterstock 2014-06-11

Paco Nathan http://liber118.com/pxn/@pacoid

meetup.com/Apache-Mesos-NYC-Meetup/events/187583352/

Page 2: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Disclaimer

The following content results from research, use case analysis, industry observations, plus personal perspectives and opinions – presented by a speaker who is an independent author/consultant.

The following content does not in any way represent the opinions or official messaging for any clients of Liber 118, Apache Foundation, United Nations, Area 51, S.P.E.C.T.R.E., etc.

Except, perhaps, for the smarter ones who nurture an ample sense of humor, which unfortunately may disqualify much of Silicon Valley…

Page 3: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Recent News

Apache releases Mesos 0.19mesos.apache.org/blog/mesos-0-19-0-released/

Program announced for inaugural #MesosConevents.linuxfoundation.org/events/mesoscon

Mesosphere takes $10.5M in fundingtechcrunch.com/2014/06/09/mesosphere-grabs-10m-in-series-a-funding-to-transform-datacenters/

Google releases part of Borg/Omega as OSSwired.com/2014/06/google-kubernetes/

Page 4: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Recent News

Apache releases Mesos 0.19mesos.apache.org/blog/mesos-0-19-0-released/

Program announced for inaugural #MesosConevents.linuxfoundation.org/events/mesoscon

Mesosphere takes $10.5M in fundingtechcrunch.com/2014/06/09/mesosphere-grabs-10m-in-series-a-funding-to-transform-datacenters/

Google releases part of Borg/Omega as OSSwired.com/2014/06/google-kubernetes/

seriously, can’t top that

Page 5: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

A Big Idea

Page 6: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

Paradigm shifts can be observed at three levels of the tech stack for cluster computing. Each implies orders of magnitude in cost savings over prior best results, based on substantive changes in software engineering practices…

Page 7: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

In other words, now that we have Mesos, Docker, and Spark, why do we need Hadoop legacy software?

Page 8: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

hard problems?

• latency • aggregation • parallelism • data rates

Countdown: Augury and Omens Aside, Part 3…

Page 9: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

hard problems => solutions

• applicative systems • leveraging semigroup structure • lazy evaluation aka combinator graph reduction • probabilistic data structures

Countdown: Augury and Omens Aside, Part 3…

Page 10: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

hard problems?

• process, data, and metadata in silos • BI + data modeling legacy culture • CAP theorem vs. ACID • accidental complexity • propagating schema and lineage • learning curve inertia • managing risk vs. innovation

Countdown: Augury and Omens Aside, Part 2…

Page 11: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

hard problems => solutions

• interdisciplinary teams • generalize across batch + real-time + etc. • separation of concerns • pattern language • compiler => query planner

Countdown: Augury and Omens Aside, Part 2…

Page 12: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

hard problems?

• commodity hardware failure rates • sched. batch is simple; sched. services is expensive • no getting around it: building a distrib system • static partitioning => cost of cluster computing • monolithic controllers vs. shared state • low util rates => upsidedown in power availability

Countdown: Augury and Omens Aside, Part 1…

Page 13: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

From Business Use Cases To Bare Metal

Datacenter Computing

Data WorkflowAbstractions

FunctionalProgramming

hard problems => solutions

• isolation • containerization • mixed workloads • data locality • service+framework architecture • predictive scheduling

Countdown: Augury and Omens Aside, Part 1…

Page 14: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Why Does This Matter?

Page 15: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

IoT Data Rates:

technologyreview.com/...

Page 16: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

IoT Data Rates:

technologyreview.com/...

Tools and techniques that served well for ad-tech will not necessarily apply for “Industrial Internet” data rates … we must retool; power requirements alone would boil the oceans

Page 17: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Some History,Part 3

Page 18: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Theory, Eight Decades Ago:

Haskell Curry, known for seminal work on combinatory logic (1927)

Alonzo Church, known for lambda calculus (1936) and much more!

!Both sought formal answers to the question, “What can be computed?”

Narrative Arc: Lambda Somethingorother

Haskell Curry haskell.org

Alonso Churchwikipedia.org

Page 19: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Praxis, Four Decades Ago:

Leveraging lambda calculus, combinators, etc., to increase parallelism of apps as applicative systems

John Backusacm.org

Narrative Arc: Lambda Somethingorother

David Tunerwikipedia.org

“Can Programming Be Liberated from the von Neumann Style? A Functional Style and Its Algebra of Programs”ACM Turing Award (1977) stanford.edu/class/cs242/readings/backus.pdf

“A new implementation technique for applicative languages”Turner, D. A. (1979) Softw: Pract. Exper., 9: 31–49. doi: 10.1002/spe.4380090105

Page 20: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Today:

Add ALL the Things: Abstract Algebra Meets Analyticsinfoq.com/presentations/abstract-algebra-analyticsAvi Bryant, Strange Loop (2013)

• grouping doesn’t matter (associativity)

• ordering doesn’t matter (commutativity)

• zeros get ignored

In other words, while partitioning data at scale is quite difficult, you can let the math allow your code to be flexible at scale

Avi Bryant@avibryant

Narrative Arc: Lambda Somethingorother

Page 21: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Algebra for Analyticsspeakerdeck.com/johnynek/algebra-for-analyticsOscar Boykin, Strata SC (2014)

Oscar Boykin @posco

A + B + C + D + E + F + G + H + I + J + K + L + M + N + O + P

+

+ +

+

+ +

+

(A + B) (C + D) (E + F) (G + H) (I + J) (K + L) (M + N) (O + P)

(A + B) + C + D + E + F + G + H + I + J + K + L + M + N + O + P

• “Associativity allows parallelism in reducing” by letting you put the () where you want

• “Lack of associativity increases latency exponentially”

Narrative Arc: Lambda Somethingorother

???

Page 22: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

That, plus oh so much more math fun in store!

Narrative Arc: Lambda Somethingorother

The Prior(past decisions)

The Evidence(the data)

The Posterior(current decision)

vu

w

x

M UΣ

VH

n r nr

=

r

m

A

z - cT x'0

x=

b

0

I

input hidden output

Page 23: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Some History,Part 2

Page 24: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

wikipedia.org/wiki/Firefly businessweek.com/1996/41/b349690.htm pubs.media.mit.edu/pubs/papers/32paper.ps

• Firefly, an early commercial recommender system

• intent: the volume of data about things is more than any person can digest

• leveraged similarity within a network

• an evolution of intelligent agents into web apps

• collect machine data about consumer interests

• people communicating with each other and with machines

Narrative Arc: Data Workflow Abstractions

Pattie MaesMIT Media Lab

machine data about cognitive social systems

Page 25: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Q3 1997 inflection point: four independent teams working toward horizontal scale-out of workflows based on commodity hardware

This effort prepared the way for huge Internet successes duringthe 1997 holiday season…

AMZN, EBAY, Inktomi (YHOO Search), then GOOG

MapReduce on clusters of commodity hardware and the Apache Hadoop open source stack emerged from this context

Narrative Arc: Data Workflow Abstractions

Page 26: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Amazon “Early Amazon: Splitting the website” – Greg Linden glinden.blogspot.com/2006/02/early-amazon-splitting-website.html !eBay “The eBay Architecture” – Randy Shoup, Dan Pritchett addsimplicity.com/adding_simplicity_an_engi/2006/11/you_scaled_your.html addsimplicity.com.nyud.net:8080/downloads/eBaySDForum2006-11-29.pdf !Inktomi (YHOO Search) “Inktomi’s Wild Ride” – Erik Brewer (0:05:31 ff) youtu.be/E91oEn1bnXM !Google “Underneath the Covers at Google” – Jeff Dean (0:06:54 ff) youtu.be/qsan-GQaeyk perspectives.mvdirona.com/2008/06/11/JeffDeanOnGoogleInfrastructure.aspx

Narrative Arc: Data Workflow Abstractions

Page 27: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

RDBMS

SQL Queryresult sets

recommenders+

classifiersWeb Apps

customertransactions

AlgorithmicModeling

Logs

eventhistory

aggregation

dashboards

Product

EngineeringUX

Stakeholder Customers

DW ETL

Middleware

servletsmodels

Narrative Arc: Data Workflow Abstractions

Page 28: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

RDBMS

SQL Queryresult sets

recommenders+

classifiersWeb Apps

customertransactions

AlgorithmicModeling

Logs

eventhistory

aggregation

dashboards

Product

EngineeringUX

Stakeholder Customers

DW ETL

Middleware

servletsmodels

“data products”

Narrative Arc: Data Workflow Abstractions

Page 29: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

See extended discussion + scorecard: www.slideshare.net/pacoid/data-workflows-for-machine-learning-33341183

Page 30: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

MapReduce

General Batch Processing

Pregel Giraph

Dremel Drill Tez

Impala GraphLab

Storm S4

Specialized Systems: iterative, interactive, streaming, graph, etc.

Narrative Arc: Data Workflow Abstractions

Page 31: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

2002

2002MapReduce @ Google

2004MapReduce paper

2006Hadoop @ Yahoo!

2004 2006 2008 2010 2012 2014

2014Apache Spark top-level

2010Spark paper

2008Hadoop Summit

The State of Spark, and Where We're Going Next Matei Zaharia Spark Summit (2013) youtu.be/nU6vO2EJAb4

action value

RDDRDDRDD

transformations RDD

How about a generalized engine for distributed, applicative systems – apps sharing code across multiple use cases: batch, iterative, streaming, etc.

Narrative Arc: Data Workflow Abstractions

Page 32: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Some History,Part 1

Page 33: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Lessons from Google

Page 34: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Datacenter Computing

Google has been doing datacenter computing for years, to address the complexities of large-scale data workflows:

• leveraging the modern kernel: isolation in lieu of VMs

• “most (>80%) jobs are batch jobs, but the majority of resources (55–80%) are allocated to service jobs”

• mixed workloads, multi-tenancy

• relatively high utilization rates

• JVM FTW? not so much…

• reality: scheduling batch is simple; scheduling services is hard/expensive

Page 36: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

“Return of the Borg”

Return of the Borg: How Twitter Rebuilt Google’s Secret Weapon Cade Metzwired.com/wiredenterprise/2013/03/google-borg-twitter-mesos

!The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines Luiz André Barroso, Urs Hölzle research.google.com/pubs/pub35290.html !!2011 GAFS Omega John Wilkes, et al. youtu.be/0ZFMlO98Jkc

Page 37: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Google describes the technology…

Omega: flexible, scalable schedulers for large compute clusters Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes eurosys2013.tudos.org/wp-content/uploads/2013/paper/Schwarzkopf.pdf

Page 38: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Google describes the business case…

Taming Latency Variability Jeff Deanplus.google.com/u/0/+ResearchatGoogle/posts/C1dPhQhcDRv

Page 39: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Commercial OS Cluster Schedulers

!• IBM Platform Symphony

• Microsoft Autopilot

!Arguably, some grid controllers are quite notable in-category:

• Univa Grid Engine (formerly SGE)

• Condor

• etc.

Page 40: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Emergingat Berkeley

Page 41: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Beyond Hadoop

Hadoop – an open source solution for fault-tolerant parallel processing of batch jobs at scale, based on commodity hardware… however, other priorities have emerged for the analytics lifecycle:

• apps require integration beyond Hadoop

• multiple topologies, mixed workloads, multi-tenancy

• significant disruptions in h/w cost/performance curves

• higher utilization

• lower latency

• highly-available, long running services

• more than “Just JVM” – e.g., Py adoption, etc.

Page 42: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Just No Getting Around It

“There's Just No Getting Around It: You're Building a Distributed System”Mark CavageACM Queue (2013-05-03) queue.acm.org/detail.cfm?id=2482856

key takeaways on architecture:

• decompose the business application into discrete services on the boundaries of fault domains, scaling, and data workload

• make as many things as possible stateless

• when dealing with state, deeply understand CAP, latency, throughput, and durability requirements

“Without practical experience working on successful—and failed—systems, most engineers take a "hopefully it works" approach and attempt to string together off-the-shelf software, whether open source or commercial, and often are unsuccessful at building a resilient, performant system. In reality, building a distributed system requires a methodical approach to requirements along the boundaries of failure domains, latency, throughput, durability, consistency, and desired SLAs for the business application at all aspects of the application.”

Page 43: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 44: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Mesos – open source datacenter computing

a common substrate for cluster computing

mesos.apache.org

heterogenous assets in your datacenter or cloud made available as a homogenous set of resources

• top-level Apache project

• scalability to 10,000s of nodes

• obviates the need for virtual machines

• isolation (pluggable) for CPU, RAM, I/O, FS, etc.

• fault-tolerant leader election based on Zookeeper

• APIs in C++, Java/Scala, Python, Go, Erlang, Haskell

• web UI for inspecting cluster state

• available for Linux, OpenSolaris, Mac OSX

Page 45: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

What are the costs of Virtualization?

benchmark type

OpenVZ improvement

mixed workloads 210%-300%

LAMP (related) 38%-200%

I/O throughput 200%-500%

response time order magnitude

more pronounced at higher loads

Page 46: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

What are the costs of Single Tenancy?

0%

25%

50%

75%

100%

RAILS CPU LOAD

MEMCACHED CPU LOAD

0%

25%

50%

75%

100%

HADOOP CPU LOAD

0%

25%

50%

75%

100%

t t

0%

25%

50%

75%

100%

Rails MemcachedHadoop

COMBINED CPU LOAD (RAILS, MEMCACHED, HADOOP)

Page 47: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Arguments for Datacenter Computing

rather than running several specialized clusters, each at relatively low utilization rates, instead run many mixed workloads

obvious benefits are realized in terms of:

• scalability, elasticity, fault tolerance, performance, utilization

• reduced equipment capex, Ops overhead, etc.

• reduced licensing, eliminating need for VMs or potential vendor lock-in

subtle benefits – arguably, more important for Enterprise IT:

• reduced time for engineers to ramp up new services at scale

• reduced latency between batch and services, enabling new high ROI use cases

• enables Dev/Test apps to run safely on a Production cluster

Page 48: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Analogies and Architecture

Page 49: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Prior Practice: Dedicated Servers

• low utilization rates

• longer time to ramp up new services

DATACENTER

Page 50: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Prior Practice: Virtualization

DATACENTER PROVISIONED VMS

• even more machines to manage

• substantial performance decrease due to virtualization

• VM licensing costs

Page 51: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Prior Practice: Static Partitioning

STATIC PARTITIONING

• even more machines to manage

• substantial performance decrease due to virtualization

• VM licensing costs

• failures make static partitioning more complex to manage

DATACENTER

Page 52: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

MESOS

Mesos: One Large Pool of Resources

“We wanted people to be able to program for the datacenter just like they program for their laptop." !

Ben Hindman

DATACENTER

Page 53: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

!

Fault-tolerant distributed systems…

…written in 100-300 lines of C++, Java/Scala, Python, Go, etc.

…building blocks, if you will

!

Q: required lines of network code?

A: probably none

Page 54: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Mesos – architecture

HDFS, distrib file system

Mesos, distrib kernel

meta-frameworks: Aurora, Marathon

frameworks: Spark, Storm, MPI, Jenkins, etc.

task schedulers: Chronos, etc.

APIs: C++, JVM, Py, Go

apps: HA services, web apps, batch jobs, scripts, etc.

Linux: libcgroup, libprocess, libev, etc.

Page 55: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Mesos – dynamics

Mesosdistrib kernel

Marathondistrib init.d

Chronosdistrib cron

distribframeworks

HAservices

scheduledapps

Page 56: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Mesos – dynamics

resourceoffers

distributedframework Scheduler Executor Executor Executor

Mesosslave

Mesosslave

Mesosslave

distributedkernel

available resources

Mesosslave

Mesosslave

Mesosslave

MesosmasterMesosmaster

Page 57: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Example: Resource Offer in a Two-Level Scheduler

mesos.apache.org/documentation/latest/mesos-architecture/

Page 59: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Looking Ahead…

Page 60: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Quasar+Mesos @ Stanford, Twitter, etc.…

Quasar: Resource-Efficient and QoS-Aware Cluster Management Christina Delimitrou, Christos Kozyrakisstanford.edu/~cdel/2014.asplos.quasar.pdf

Page 61: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Quasar+Mesos @ Stanford, Twitter, etc.…

Improving Resource Efficiency with Apache Mesos Christina Delimitrouyoutu.be/YpmElyi94AA

Page 62: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Quasar+Mesos @ Stanford, Twitter, etc.…

Consider that for datacenter computing at scale, a surge in workloads implies:

• large cap-ex investment, long lead-time to build

• utilities cannot supply the power requirements

Even for large players that achieve 2x beyond typical industry DC util rates, those factors become show-stoppers. Even so, high rates of over-provisioning are typical, so there’s much room to improve.

Experiences with Quasar+Mesos showed:

• 88% apps get >95% performance

• ~10% overprovisioning instead of 500%

• up to 70% cluster util at steady state

• 23% shorter scenario completion

Page 63: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Because… Use Cases

Page 64: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Production Deployments (public)

Page 65: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Built-in /bare metal

Hypervisors

Solaris Zones

Linux CGroups

Opposite Ends of the Spectrum, One Common Substrate

Page 66: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Opposite Ends of the Spectrum, One Common Substrate

Request /Response Batch

Page 67: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Case Study: Twitter (bare metal / on premise)

“Mesos is the cornerstone of our elastic compute infrastructure – it’s how we build all our new services and is critical for Twitter’s continued success at scale. It's one of the primary keys to our data center efficiency."

Chris Fry, SVP Engineering blog.twitter.com/2013/mesos-graduates-from-apache-incubation wired.com/gadgetlab/2013/11/qa-with-chris-fry/ !

• key services run in production: analytics, typeahead, ads

• Twitter engineers rely on Mesos to build all new services

• instead of thinking about static machines, engineers think about resources like CPU, memory and disk

• allows services to scale and leverage a shared pool of servers across datacenters efficiently

• reduces the time between prototyping and launching

Page 68: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Case Study: Airbnb (fungible cloud infrastructure)

“We think we might be pushing data science in the field of travel more so than anyone has ever done before… a smaller number of engineers can have higher impact through automation on Mesos."

Mike Curtis, VP Engineeringgigaom.com/2013/07/29/airbnb-is-engineering-itself-into-a-data...

• improves resource management and efficiency

• helps advance engineering strategy of building small teams that can move fast

• key to letting engineers make the most of AWS-based infrastructure beyond just Hadoop

• allowed company to migrate off Elastic MapReduce

• enables use of Hadoop along with Chronos, Spark, Storm, etc.

Page 69: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Case Study: eBay (continuous integration)

eBay PaaS Team ebaytechblog.com/2014/04/04/delivering-ebays-ci-solution-with-apache-mesos-part-i/

• cluster management (PaaS core framework services) for CI

• integration of: OpenStack, Jenkins, Zookeeper, Mesos, Marathon, Ansible

In eBay’s existing CI model, each developer gets a personal CI/Jenkins Master instance. This Jenkins instance runs within a dedicated VM, and over time the result has been VM sprawl and poor resource utilization. We started looking at solutions to maximize our resource utilization and reduce the VM footprint while still preserving the individual CI instance model. After much deliberation, we chose Apache Mesos for a POC. This post shares the journey of how we approached this challenge and accomplished our goal.

Page 70: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Case Study: HubSpot (cluster management)

Tom Petr youtu.be/ROn14csiikw

mesosphere.io/resources/mesos-case-study-hubspot/

• 500 deployable objects; 100 deploys/day to production; 90 engineers; 3 devops on Mesos cluster

• “Our QA cluster is now a fixed $10K/month — that used to fluctuate”

Page 71: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

DIY

Page 72: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

!

!

http://elastic.mesosphere.io

!

http://mesosphere.io/learn !

Page 73: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 74: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 75: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 76: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 77: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 78: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 79: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 80: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 81: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos
Page 82: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

SummaryQuestion

Page 83: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Given the points about Part 3, Part 2, Part 1…Given the history from Church and Curry to BDAS and Twitter OSS… Given the needs, e.g., IoT preferably not boiling the oceans…

Why do we still see proto-legacy systems like Tez? Or, for that matter, why do we find notable experts stating that “Hadoop is an OS” ?

It’s time to set the legacy of YHOO circa 2009 aside, to step up to contemporary challenges with better understanding of the underlying math and CS theory => solving business use cases at scale

To paraphrase author William Gibson, the future is already here – it’s just not very evenly distributed, nor is it google-able

Summary Question:

Page 84: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

IoT Data Rates:

???

Page 85: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

ありがとうございました

Page 86: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

monthly newsletter for updates, events, conf summaries, etc.:

liber118.com/pxn/

Enterprise Data Workflows with Cascading O’Reilly, 2013

shop.oreilly.com/product/0636920028536.do

Just Enough Math O’Reilly, 2014

oreilly.com/go/enough_math/preview: youtu.be/TQ58cWgdCpA

Page 87: Augury and Omens Aside, Part 1: The Business Case for Apache Mesos

Spark SummitSF, Jun 30 15% code: Paco2014 spark-summit.org/2014

OSCON 2014 PDX, Jul 20 20% code: PACOID oscon.com/oscon2014/

#MesosCon Chicago, Aug 21 events.linuxfoundation.org/events/mesoscon

Strata NYC + Hadoop World NYC, Oct 15 strataconf.com/stratany2014

Data Day Texas Austin, Jan 10 datadaytexas.com

calendar: