Upper and Lower Bound on the Cost of a MapReduce Computation

Upper and Lower Bounds

on the Cost of a

Map-Reduce Computation

38th International Conference on Very Large Data Bases (VLDB 2012)

Tzu-Li TaiNational Cheng Kung UniversityDept. of Electrical EngineeringHPDS Laboratory

Foto N. AfratiNational Technical University of Athens

Anish Das SarmaGoogle Research

Semih Salihoglu, Jeffrey D. UllmanStanford University

Agenda

A. BackgroundB. A Motivating ExampleC. Tradeoff: Parallelism & CommunicationD. Problem Model and AssumptionsE. The Hamming-Distance-1 ProblemF. Conclusion

Background

The MapReduce Paradigm

Reduce

(𝒌𝟏, 𝒗𝟏)

(𝒌𝟐, 𝒗𝟐)

(𝒌𝟐, 𝒗𝟐) (𝒌𝟐, [𝑽𝟐]) (𝒌𝟑, 𝒗𝟑)

Background

Distributed/Parallel Computing in Clusters

• Often uses MapReduce to express applications (Hadoop)- This paper focuses on single-round MR applications

• Limited bandwidth

• Limited resources (memory, processing units, etc.)

• For public clouds, you “pay as you go” for these resources- Amazon EC2 charges for both bandwidth usage & processing units

A Motivating Example

The Drug Interaction Problem

• 3000 sets of drug data (patients taking, dates, diagnoses)

• About 1M of data per drug

• Problem:Find 2 drugs that when taken together increase the risk of heart attack

• Cross-referencing 2 drugs across whole set of drugs

Reduce for {𝟏, 𝟐}

Drug 1

Drug 2

Drag 3

Drug 4

Reduce for {𝟏, 𝟑}

Reduce for {𝟏, 𝟒}

Reduce for {𝟐, 𝟑}

Reduce for {𝟐, 𝟒}

Reduce for {𝟑, 𝟒}

( 1,2 , )data 1

( 1,3 , )data 1

( 1,4 , )data 1

( 1,2 , )data 2

( 2,3 , )data 2

( 2,4 , )data 2

( 1,3 , )data 3

( 2,3 , )data 3

( 3,4 , )data 3

( 1,4 , )data 4

( 2,4 , )data 4

( 3,4 , )data 4

( 1,2 , )data 1+2

( 1,3 , )data 1+3

( 1,4 , )data 1+4

( 2,3 , )data 2+3

( 2,4 , )data 2+4

( 3,4 , )data 3+4

What Went Wrong?

• For 3000 drugs, each set of drug data is replicated 2999 times

• Each set of data is 1M large= 9 terabytes of communication= 90,000 sec for 1 Gigabit network

• Communication cost is too high!

Drug 1

Drug 2

Drug 3

Drug 4

Drug 5

Drug 6

( 𝐺1, 𝐺2 , )data 1

( 𝐺1, 𝐺3 , )data 1

( 𝐺1, 𝐺2 , )data 2

( 𝐺1, 𝐺3 , )data 2

( 𝐺1, 𝐺2 , )data 3

( 𝐺2, 𝐺3 , )data 3

( 𝐺1, 𝐺2 , )data 4

( 𝐺2, 𝐺3 , )data 4

( 𝐺1, 𝐺3 , )data 5

( 𝐺2, 𝐺3 , )data 5

( 𝐺1, 𝐺3 , )data 6

( 𝐺2, 𝐺3 , )data 6

Different Approach: Grouping Drugs• 𝐺1: Drugs 1-2• 𝐺2: Drugs 3-4• 𝐺3: Drugs 5-6

Key: Own Group + Other Groups

Drug 1

Drug 2

Drug 3

Drug 4

Drug 5

Drug 6

( 𝐺1, 𝐺2 , )data 1

( 𝐺1, 𝐺3 , )data 1

( 𝐺1, 𝐺2 , )data 2

( 𝐺1, 𝐺3 , )data 2

( 𝐺1, 𝐺2 , )data 3

( 𝐺2, 𝐺3 , )data 3

( 𝐺1, 𝐺2 , )data 4

( 𝐺2, 𝐺3 , )data 4

( 𝐺1, 𝐺3 , )data 5

( 𝐺2, 𝐺3 , )data 5

( 𝐺1, 𝐺3 , )data 6

( 𝐺2, 𝐺3 , )data 6

Reduce for {𝑮𝟏, 𝑮𝟐}

Reduce for {𝑮𝟏, 𝑮𝟑}

Reduce for {𝑮𝟐, 𝑮𝟑}

( 𝐺1, 𝐺2 , )data 1+2+3+4

( 𝐺1, 𝐺3 , )data 1+2+5+6

( 𝐺2, 𝐺3 , )data 3+4+5+6

• Therefore, if we group 3000 drugs as 30 groups- 𝐺1: 1-100, 𝐺2: 101-200, ……, 𝐺3:2901-3000

• Each set of drug data is only replicated 29 times= 87 GB vs. 9TB communication cost

• But lower parallelism, higher processing cost!

Tradeoff: Parallelism & Communication

ParallelismCommunication

• To evaluate communication cost, define 𝑟𝑒𝑝𝑙𝑖𝑐𝑎𝑡𝑖𝑜𝑛 𝑟𝑎𝑡𝑒 𝒓, which represents the average number of key-value pairs created from a single map input

• To evaluate processing cost, define 𝑟𝑒𝑑𝑢𝑐𝑒𝑟 𝑠𝑖𝑧𝑒 𝒒, which represents the maximum amount of values for a single key

Drug 1

Drug 2

Drug 3

Drug 4

Drug 5

Drug 6

( 𝐺1, 𝐺2 , )data 1

( 𝐺1, 𝐺3 , )data 1

( 𝐺1, 𝐺2 , )data 2

( 𝐺1, 𝐺3 , )data 2

( 𝐺1, 𝐺2 , )data 3

( 𝐺2, 𝐺3 , )data 3

( 𝐺1, 𝐺2 , )data 4

( 𝐺2, 𝐺3 , )data 4

( 𝐺1, 𝐺3 , )data 5

( 𝐺2, 𝐺3 , )data 5

( 𝐺1, 𝐺3 , )data 6

( 𝐺2, 𝐺3 , )data 6

( 𝐺1, 𝐺2 , )data 1+2+3+4

( 𝐺1, 𝐺3 , )data 1+2+5+6

( 𝐺2, 𝐺3 , )data 3+4+5+6

𝒓 = 𝟐, 𝒒 = 𝟒

How the Tradeoff can be Used

𝑟 = 𝑓(𝑞)

• Communication cost: 𝑎𝑟, a: constant

• Processing cost: Some function of 𝑞- Take for example the previous drug interaction problem- The work for each reducer is 𝑂 𝑞2 , so

𝐶𝑜𝑠𝑡𝑒𝑎𝑐ℎ = 𝑏𝑞2, b: constant

- The number of reducers is proportional to 1

- 𝐶𝑜𝑠𝑡𝑡𝑜𝑡𝑎𝑙 = 𝑏𝑞2 ×1

𝑞= 𝑏𝑞

How the Tradeoff can be Used

𝐶𝑜𝑚𝑏𝑖𝑛𝑒𝑑 𝐶𝑜𝑠𝑡 = 𝑎𝑟 + 𝑏𝑞= 𝑎𝑓 𝑞 + 𝑏𝑞

• Solve for 𝑞 for minimal combined cost

• Determine 𝑟 with 𝑟 = 𝑓(𝑞)

• Decide appropriate algorithm implementation

Problem Model & Assumptions

Mapping Schema𝑟 , 𝑞

Hypothetical set of all inputsconstructed from domain N

Finite domain N All possible outputs corresponding to

the inputs

Example: Hamming Distance 1

1011010011

Distance:2

1011010010

Distance:1

Example: Hamming Distance 1

000……00000……01000……10

.111……00111……01111……10111……11

{Domain: 𝒃 bits string length

2𝑏ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑡𝑖𝑐𝑎𝑙𝑖𝑛𝑝𝑢𝑡𝑠

Mapping Schema𝑟 , 𝑞

No. of outputs =

𝟐𝒃 × 𝒃

The Mapping Schema Tradeoff Derivation

Given the maximum reducer size 𝑞, and assume there are 𝑝 reducers,

𝑟 =

𝑖=1

𝑞𝑖𝐼

𝑞𝑖: reducer size of reducer 𝑖 (𝑞𝑖 ≤ 𝑞)𝐼: Total input size

Drug 1

Drug 2

Drug 3

Drug 4

Drug 5

Drug 6

( 𝐺1, 𝐺2 , )data 1

( 𝐺1, 𝐺3 , )data 1

( 𝐺1, 𝐺2 , )data 2

( 𝐺1, 𝐺3 , )data 2

( 𝐺1, 𝐺2 , )data 3

( 𝐺2, 𝐺3 , )data 3

( 𝐺1, 𝐺2 , )data 4

( 𝐺2, 𝐺3 , )data 4

( 𝐺1, 𝐺3 , )data 5

( 𝐺2, 𝐺3 , )data 5

( 𝐺1, 𝐺3 , )data 6

( 𝐺2, 𝐺3 , )data 6

( 𝐺1, 𝐺2 , )data 1+2+3+4

( 𝐺1, 𝐺3 , )data 1+2+5+6

( 𝐺2, 𝐺3 , )data 3+4+5+6

𝒒𝟏 = 𝟒

𝒒𝟐 = 𝟒

𝒒𝟑 = 𝟒𝑰=𝟔

⇒ 𝒓 =

𝒊=𝟏

𝒒𝒊𝑰 =𝟒 + 𝟒 + 𝟒

𝟔= 𝟐

1. Deriving 𝑔(𝑞): upper bound of outputs a reducer with size 𝑞 covers

Finding the lower bound of 𝒓 with given 𝒒

Drug 1

Drug 2

Drug 3

Drug 4

Drug 5

Drug 6

( 𝐺1, 𝐺2 , )data 1

( 𝐺1, 𝐺3 , )data 1

( 𝐺1, 𝐺2 , )data 2

( 𝐺1, 𝐺3 , )data 2

( 𝐺1, 𝐺2 , )data 3

( 𝐺2, 𝐺3 , )data 3

( 𝐺1, 𝐺2 , )data 4

( 𝐺2, 𝐺3 , )data 4

( 𝐺1, 𝐺3 , )data 5

( 𝐺2, 𝐺3 , )data 5

( 𝐺1, 𝐺3 , )data 6

( 𝐺2, 𝐺3 , )data 6

( 𝐺1, 𝐺2 , )data 1+2+3+4

( 𝐺1, 𝐺3 , )data 1+2+5+6

( 𝐺2, 𝐺3 , )data 3+4+5+6

𝒒 = 𝟒

⇒ 𝒄𝒐𝒗𝒆𝒓𝒔𝟒𝟐𝒐𝒖𝒕𝒑𝒖𝒕𝒔

⟹ 𝒈 𝒒 =𝒒𝟐=𝒒(𝒒 − 𝟏)

𝟐≈𝒒𝟐

𝟐 2219

1. Deriving 𝑔(𝑞): upper bound of outputs a reducer with size 𝑞 covers2. Determine number of Inputs 𝐼 and Outputs 𝑂3. Establish Inequality:

𝑖=1

𝑔(𝑞𝑖) ≥ 𝑂

4. Manipulate Inequality:

𝑖=1

𝑞𝑖𝑔(𝑞𝑖)

𝑞𝑖≥ 𝑂 ⇒

𝑖=1

𝑞𝑖𝑔(𝑞)

𝑞≥ 𝑂

Finding the lower bound of 𝒓 with given 𝒒

⇒ 𝒓 =

𝑖=1

𝑞𝑖𝐼 ≥𝒒 × 𝑶

𝒈(𝒒) × 𝑰

The Hamming-Distance-1 Problem

1. 𝑔 𝑞 = ( 𝑞 2) log2 𝑞 (by mathematical induction)

2. 𝐼 = 2𝑏, 𝑂 =𝑏

22𝑏

3. Inequality:

𝑖=1

𝑔 𝑞𝑖 =

𝑖=1

𝑝𝑞𝑖2log2 𝑞𝑖 ≥

22𝑏

𝑖=1

𝑝𝑞𝑖2log2 𝑞 ≥

22𝑏

⇒ 𝒓 =

𝑖=1

𝑞𝑖2𝑏≥ 𝒃 𝐥𝐨𝐠𝟐 𝒒

Conclusion

• Presents a new approach to study optimal Map-Reduce algorithms

• Established a unified model with two parameters, replication rate and reducer size to study performance over a spectrum ofpossible computing clusters.

• For several problems, it had been shown that the two parameters are related by a tradeoff formula.

Upper and Lower Bound on the Cost of a MapReduce Computation

Technology

MapReduce. MapReduce Outline MapReduce Architecture MapReduce Internals MapReduce Examples JobTracker Interface

An accumulative computation framework on MapReduce ppl2013

Introduction to MapReduce | MapReduce Architecture | MapReduce Fundamentals

Data Intensive Text Processing with MapReduce - #3 MapReduce Algorithm Design -

Automatically Leveraging MapReduce Frameworks for Data ... · MapReduce is a popular programming paradigm for running large-scale data-intensive computation. Recently, many frameworks

Parallel Computation of Skyline and Reverse Skyline Queries … · 2019-07-12 · Parallel Computation of Skyline and Reverse Skyline Queries Using MapReduce Yoonjae Park Seoul National

MapReduce-MPI Library Users Manualmapreduce.sandia.gov/doc/Manual.pdf · MapReduce-MPI WWW Site - MapReduce-MPI Documentation What is a MapReduce? The canonical example of a MapReduce

An Enriched Framework for Outsourced Computation ...ijarcsms.com › docs › paper › volume5 › issue5 › V5I5-0024.pdf · algorithm (Improved MapReduce Apriori Algorithm) which

Kodiak: An Implementation Framework for Branch and Bound ... · 1 Introduction Branch and bound is a numerical computation method for successive re ne-ment of a solution set over

Pipelined-MapReduce an Improved MapReduce

A Model of Computation for MapReduce Karloff, Suri and Vassilvitskii (SODA ’ 10) Presented by Ning Xie

MapReduce · 2020. 7. 22. · Hadoop is an implementation of MapReduce 14. Why MapReduce • GFS: distributed system to store more data than possible on one computer • MapReduce:

MapReduce for the Cell B.E. Architecturepages.cs.wisc.edu/~dekruijf/docs/mapreduce-cell.pdf · overlapping computation with memory transfers as much as possible. Third, between the

Python MapReduce Programming with Pydoop · MapReduce and Hadoop Hadoop Crash Course Pydoop: a Python MapReduce and HDFS API for Hadoop Python MapReduce Programming with Pydoop Simone

Terasort Using SAGA-MapReduce Given by: Sharath Maddineni CCT: Center for Computation & Technology

1. Introduction to MapReduce - UPMlsd.ls.fi.upm.es/.../IntroToMapReduce.pdf · Processing of massive data: MapReduce – 1. Introduction to MapReduce MapReduce has a 'low semantic

MapReduce Based Personalized Locality Sensitive … Based Personalized Locality Sensitive Hashing for ... end-to-endset-similarity join algorithm [12], fast computation of ... minhashing

EE324 DISTRIBUTED SYSTEMS FALL 2015 MapReduce. Overview 2 MapReduce

Distributed Data Management - - TU Kaiserslautern€¢Abstract computation ... –Distributed and –Large scale Distributed Data ... Sanjay Ghemawat: MapReduce: Simplified Data Processing

MapReduce vs Pig | MapReduce Pig Integration