How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRocket MongoDB Platform

Scaling 40x on the ObjectRocket MongoDB Platform Jon Hyman & Kenny Gorman MongoDB World, June 25, 2014 NYC

@appboy @objectrocket @jon_hyman @kennygorman

A LITTLE BIT ABOUT JON & APPBOY

Jon Hyman CIO :: @jon_hyman !

Appboy is a marketing automation platform for apps

Harvard Bridgewater

A LITTLE BIT ABOUT KENNY & OBJECTROCKET

Kenny Gorman Co-Founder & Chief Architect :: @kennygorman !

ObjectRocket is a highly available, sharded, unbelievably fast MongoDB as a service

ObjectRocket eBay Shutterfly

Agenda

• Evolution of Appboy’s MongoDB installation as we grew to handle billions of data points per month

• Operational MongoDB issues we worked through

MongoDB Evolution:

March, 2013

Mar May July Sept Nov Jan

Apr Jun Aug Oct Dec Feb

What did Appboy look like in March, 2013?•~2.5 million events per day tracking 8 million users

• Event storage: every data point as a new document

• Single, unsharded replica set on AWS (m2.xlarge)

• Mostly long-tail customers; biggest app had 2M users

What did Appboy look like in March, 2013?•~2.5 million events per day tracking 8 million users

• Event storage: every data point as a new document

• Single, unsharded replica set on AWS (m2.xlarge)

• Mostly long-tail customers; biggest app had 2M users

Growing a lot on disk. :-( !

Started running into locking issues (30-40%). :-(

MongoDB Evolution:

April, 2013

Scaled vertically

What happened in April, 2013?

• First enterprise client signs

• More than 50 million users

• They estimated sending us over 1 billion data points per month

What happened in April, 2013?

• First enterprise client signs

• More than 50 million users

• They estimated sending us over 1 billion data points per month

“Btw, we’re going live next month”

MongoDB Evolution:

April, 2013: holy crap!

ObjectRocket: Getting Started

• The landscape of a simple configuration

• It’s all about choosing shard keys

• Locks - you know you love them

What are we going to do?• Contain growth from data points:

• Shifted to Amazon Redshift for “raw data”

• Moved MongoDB to storing pre-aggregated analytics for time series data

• Figure out sharding ASAP

• Moved to ObjectRocket, worked on shard key selection

• Sharding was hard:

• Tough to figure out the right shard key, make tradeoffs

• Rewrite a lot of application code to include shard keys in queries, inserts, adjust to life without unique indexes

Shard key selections• Users

• Had multiple ways to identify a user

• Device identifier, “external user id”, BSON ID

• Often performed large scans of user bases

Shard key selections• Users

• Had multiple ways to identify a user

• Device identifier, “external user id”, BSON ID

• Often performed large scans of user bases

{_id: “hashed”} !

• Cache secondary identifiers to BSON ID to reduce scatter-gather queries

• Doing scatter gathers goes against conventional wisdom

Shard key selections• Pre-aggregated analytics

• Always query history for a single app

• 1 document per day per app per metric

{app_id: 1}

MongoDB Evolution:

May - October, 2013

Scaled vertically

Start sharding

Everything sharded

What did Appboy look like in May - October, 2013?• textPlus goes live, as do other customers

• > 1 billion events per month, doing great!

• 4, 100GB shards on ObjectRocket

MongoDB Evolution:

November, 2013

Scaled vertically

Start sharding

Everything sharded

Various customer launches

What happened in November, 2013?

• One of the largest European soccer apps

• Soccer games crushed us: 15 million data points per hour just from this app!

• Lock percentage ran high, a single shard was pegged

• Real-time analytics processing got severely delayed, adding more servers did not help (in fact, it made things worse)

• Soccer games crushed us: 15 million data points per hour just from this app!

• Lock percentage ran high, a single shard was pegged

• Real-time analytics processing got severely delayed, adding more servers did not help (in fact, it made things worse)

Why a single shard?

{app_id: 1}

ObjectRocket: Capacity, Growth

• Concurrency

• Did I mention locks?

• Cache management

• Compaction

• The shell game

• Indexing at scale

How to fix this?• Fundamentally, all updates are going to a single document

• Can’t shard out a single document

• Asked ObjectRocket for their suggestions

How to fix this?• Fundamentally, all updates are going to a single document

• Can’t shard out a single document

• Asked ObjectRocket for their suggestions

Introduce write buffering

Write buffering• Buffer writes to something that can be sharded out, then flush to MongoDB

• Need something transactional, so MongoDB was out for this

• Decided on multiple Redis instances:

• Redis has native hash data structure with atomic hash increments, works nicely with MongoDB in this use-case

Write buffering

Incoming data Flush to MongoDB

Write buffering• Wrote write buffering over a weekend to buffer writes to MongoDB every 3 seconds

Pre-aggregated analytics bottleneck was solved!

MongoDB Evolution:

January, 2014

Scaled vertically

Start sharding

Everything sharded

Bad shard key hit upper limit

Added write buffering

What did Appboy look like in January, 2014?• > 3 billion events per month

• 4, 100GB shards on ObjectRocket

• Performance started to have really bad bursty behavior: sometimes user experience would slow down to what we thought was unacceptable for our customers

Why was performance getting worse?

• Appboy customers send millions of messages in a single campaign, most are sending hundreds of thousands to millions of messages each week

• Campaign times tend to cluster together across all Appboy customers: evenings, Saturday/Sunday afternoons, etc.

A lot of enormous read activity

Why was performance getting worse?

• Appboy customers send millions of messages in a single campaign, most are sending hundreds of thousands to millions of messages each week

• Campaign times tend to cluster together across all Appboy customers: evenings, Saturday/Sunday afternoons, etc.

A lot of enormous read activity Reads and writes and more reads start conflicting :-(

• Users visiting our dashboard during simultaneous large campaign sends would have sporadic poor performance

ObjectRocket: Splits• Split out collections to different MongoDB clusters

After Before

What did Appboy look like in February, 2014?

• Splits helped

• > 4 billion events per month

• We needed more

What did Appboy look like in February, 2014?

• Splits helped

• > 4 billion events per month

• We needed more Isolation

ObjectRocket: Isolation• Isolate large enterprise customers on their own MongoDB databases/clusters

• Appboy built this in March, 2014

Enterprise customer

Long-tail customer

Scaled vertically

Start sharding

Everything sharded

Bad shard key hit upper limit

Added write buffering

Start splitting DBs Isolation

Summary

What’s next?• Figure out capacity planning

• Continue down isolation path

15000000

30000000

45000000

60000000

Thanks!jon@appboy.com !kgorman@objectrocket.com

@appboy @objectrocket @jon_hyman @kennygorman

How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRocket MongoDB Platform

Technology

PowerPoint Presentation · 203 MEGA PIXELS HIGH-SENSITIVITY CMOS 24mm WIDE 40x SYSTEM Canon one sees if/lkeyoa

Confocal 2 – Zeiss 880 with Fast Airyscan · ZEN Black Available Objective Lenses: 10x Air, 20x Air, 40x Oil and 63x Oil (40x Water is also available) Available Laser Lines: 405,

MongoDB Scalability Best Practices - Percona · MongoDB Scalability Best Practices Jason Terpko DBA, Rackspace/ObjectRocket 1

MS-1400 & TM-20X/40X Keri Key...PN: 01502-001 Rev. C MS-1400 & TM-20X/40X Keri Key Receiver/Transmitter System DATA SHEET, Page 2 of 2 Presented By: Speci˜ cations*: MS-1400 Receiver

Name the type of tissue. - Lone Star College Systemtype of tissue. Nonkeratinized Skin IOOX iew 1. View 000000 Magnification: 5k 500 um View . View . View 400X 40X IOOX 40X IOOX 40X

MOOV CHOPPER - s3-ap-southeast-1.amazonaws.com€¦ · 10.2 10 10.1 2x 40x 2x 2x 1x 2 2x 1x 6 6 10.1 10.2 1x 2x 42. MOOV CHOPPER 43. MOOV CHOPPER. 10.2 10 10.1 2x 40x 2x 2x 1x 2 2x

Accelerate Office 365 Performance Up to 40X with Aryaka · 2019-03-23 · Accelerate Office 365 Performance Up to 40X with Aryaka Aryaka’s SmartCONNECT SD-WAN as a Service with

Indirect Immunofluorescence Test for detection of ......Indirect Immunofluorescence Test for detection of Taylorella equigenitalis ... • Fluorescent microscope with a 40x dry objective

XR-40X XR-30X XR-30S

MODEL t XR-40X XR-30X XR-30S - cgi.sharp-world.comcgi.sharp-world.com/products/data_projector/xr40x/... · MULTIMEDIA PROJECTOR MODEL XR-40X XR-30X XR-30S OPERATION MANUAL t Setup

Gear/Engines/Enya/1978... · Enya 40X-TV "Model 6101," the first examples of which began reach- ing the U.K. a few months ago. It is the 40X-TV that is the subject of this month's

How Long Have I Got, Doc? - Dying Matters Flier_for web.pdf · • “How Long Have I Got, Doc” DVD • 40x leaflet #1 “Five things to do before I die ” • 40x leaflet #5 “To

DR-40X REFERENCE MANUAL - TASCAM · 4 TASCAM DR-40X 1 – Introduction Features i Compact audio recorder that uses SD/SDHC/SDXC cards as recording media i High-performance directional

MODEL t XR-40X XR-30X XR-30S - Sharp Global › ... › pdf › XR40X-XR30X-XR30S_ex2_en.pdf · 2007-10-04 · SHARP PROJECTOR, MODEL XR-40X/XR-30X/XR-30S This device complies with

Rackspace ObjectRocket for Redis white paper 1H2415.indd

· ObjectRocket, a MongoDB database- as-a-service provider that Rackspace acquired in 2013. "Given the growth of big data initiatives by our Hong Kong member Of - Peter Man, Red

p8 40x Ae Files and Registry

Alternaria spp. - SciELO · Retrato Microbiológico 605 Alternaria spp. Figura 3. Observación microscópica del cultivo sin tinción (aumento 40X), Lab MTU

PP-PRODUCT CATALOG RD1015 44PAGES 5.5X8.5 PROOF 5 2files.pureplanet.com/CollateralAndImages/ProductCatalog.pdf · INGREDIENTS: 40x Aloe Vera Concentrate. Other ingredi-ents required

DR-40X REFERENCE MANUAL€¦ · 6 TASCAM DR-40X 2 – Names and Functions of Parts Top panel 1 Built-in stereo microphone This is a stereo pair of unidirectional electret condenser