Accelerating MapReduce with Distributed Memory Cache

Author: Shubin Zhang, et al.

Institute of Computing Technology, Beijing, China

Reported by: Tzu-Li Tai

National Cheng Kung University, Taiwan

High Performance Parallel and Distributed Systems Lab

2009 IEEE 15th International Conference on Parallel and Distributed Systems

HPDS Lab, Institute of Computer and Communication Engineering, Electrical Engineering - NCKU

A. Background and Motivation

B. Goals and Design Decisions

C. System Overview

D. System Details

E. Experimental Results and Analysis

F. Conclusion and Future Works

G. Future Studies for Topic

E. Discussion: Our Chances

Background and Motivation

Pre-Notes:

- Published in 2009 (1st paper on topic)

- Outdated hardware/software and data size

- Focus on methodology and reasoning of using

distributed cache in Hadoop

- Learn possible tackle points and what to avoid

Background and Motivation

• Shuffle time becomes the bottleneck

Inter.

HDFS pipeline replication

HDFS HDFS HDFS

1 local write

2 remote read

Goals and Design Decisions

• Target clusters are small-scale

- Bandwidth is not scarce

- Node failures are uncommon- Commodity machines

- Heterogeneous

- GB Ethernet

• Stay close to the original

• Retain fault-tolerance (!)

• Local decision-making

• Low-latency, high-throughput access to map

outputs: global storage system

- No central coordinator

- Uniform global namespace- Low-latency, high-throughput data access

- Concurrent access

- Large capacity

- Scalable

⇒ 𝑫𝒊𝒔𝒕𝒓𝒊𝒃𝒖𝒕𝒆𝒅𝑴𝒆𝒎𝒐𝒓𝒚 𝑪𝒂𝒄𝒉𝒆

System Overview

• Use Memcached: http://memcached.org/

- Open-source distributed memory caching system

- Daemon processes on servers

- Global K-V store

System Overview

• Map side

! Buffer details

System Overview

[Extra] from O’Reilly

System Overview

• Reduce side

System Overview

[Extra] from O’Reilly

System Details

1. Memory Cache Capacity

System Details

1. Memory Cache Capacity

𝑆𝑖𝑧𝑒𝑚𝑒𝑚𝑐𝑎𝑐ℎ𝑒𝑑 = 𝑚𝑐 × 𝑠 × (𝑟 − 𝑟𝑎)

𝒎𝒄: completed map tasks

𝒔: avg. map output size

𝒓: total no. of reduce tasks

𝒓𝒂: no. of early scheduled reduce tasks

𝑆𝑖𝑧𝑒𝑚𝑒𝑚𝑐𝑎𝑐ℎ𝑒𝑑𝑀𝑖𝑛 = 𝑚 × 𝑠 × (𝑟 − 𝑟𝑎)

𝒎: total no. of map tasks

System Details

2. Network Traffic Demand

𝑆ℎ𝑢𝑓𝑓𝑙𝑒𝑑 𝐷𝑎𝑡𝑎 = 2 ∗ 𝑆𝑖𝑧𝑒𝑚𝑒𝑚𝑐𝑎𝑐ℎ𝑒𝑑

System Details

2. Network Traffic Demand

- Double amount of data shuffled (!)

- A compression algorithm is used on map outputs to

lessen network traffic

System Overview

• Map side

! Hashing function?

System Details

3. Fault Tolerance

• Map task failure

- rerun outputs not yet in memcache/disk

• Reduce task failure (!)

- For inputs that are not yet deleted from memcache, copy and execute

- For inputs that are already deleted from memcache, rerun the map task

• Memcached Server failure (!)

- Reinitialize all related map tasks

• Tasktracker failure

- All currently running map tasks and reduce tasks needs to be reinitialized

- Memcache data is still valid, so reduce tasks can still access them

System Details

3. Fault Tolerance

Reduce task failure

System Details

3. Fault Tolerance

Memcached Server Failure

Experimental Results and Analysis

Environment

• Hardware

- Intel Pentium 4, 2.8GHz processor

- 2GB RAM

- 80GB 7200RPM SATA disk

• Software

- RedHat AS4.4, kernel 2.6.9 OS

- Hadoop 0.19.1

- Memcached 1.2.8

- Memcached client for Java 2.5.1

Hadoop+Memcached Setup

1 node

NameNode +

JobTracker +

Memcached Server (1GB RAM)

1~6 nodes

DataNode + TaskTracker

• 2 map slots + 2 reduce slots per

TaskTracker

• 4MB HDFS file block

• 5 shuffle threads in reduce tasks

Benchmark Applications

• Wordcount

- 491.4 MB English text

• Spatial Join Algorithm

- 2 data sets from TIGER/Line files

1. Impact of different node numbers

• No. of reduce tasks: 2*n

• Wordcount improvement: 43.1%

• Spatial Join improvement: 32.9%

2. Impact on job progress

*Note: Hadoop job progress calculation

- For Map tasks: % of input processed

- For Reduce tasks:

1/3 (copy) + 1/3 (sort) + 1/3 (actual processing)

2. Impact on job progress - WordCount

2. Impact on job progress – Spatial Join

2. Impact on job progress - Extra

reduce

Conclusion and Future Works

• Enhanced Hadoop to accelerate data

shuffling by using distributed memory

cache (memcached)

• Prototype performs much better than

original Hadoop under moderate load.

• Will modify task scheduling algorithm

(earlier reduce tasks)

Future Studies for Topic

• Dache: A Data Aware Caching for Big Data

Applications Using The MapReduce Framework,

2013 IEEE INFOCOM

• A Distributed Cache for Hadoop Distributed File

System in Real-Time Cloud Services,

2012 ACM/IEEE GRID

Discussion: Our Chances

1. Necessity of using Memcached?

Properties for map task buffer:

io.sort.mb: buffer size

io.sort.spill.percent: spill-to-disk threshold

Hypothesis:

• Achieve map intermediate output local cache

• Modify reduce shuffle threads + TaskTracker

• RDD for Fault Tolerance?

2. Moving the idea to YARN

NodeManager

NodeManager NodeManager

Application

Manager

REDUCE

“Shuffle and Sort”

NodeManager

Auxiliary Service

yarn-site.xml

• Entire shuffle and sort phase is implemented as a

pluggable aux. service in YARN

3. Iterative applications

NodeManager

Application

Manager

NodeManager

Application

Manager

“Result caching + reuse”

NodeManager

Auxiliary Service

Accelerating MapReduce with Distributed Memory Cache

Technology

MapReduce · 2020. 7. 22. · Hadoop is an implementation of MapReduce 14. Why MapReduce • GFS: distributed system to store more data than possible on one computer • MapReduce:

Hadoop/MapReduce - 123seminarsonly.comHadoop MapReduce • MapReduce is a programming model and software framework first developed by Google (Google’s MapReduce paper submitted in

Google’s MapReduce Programming Model — Revisitedlaemmel/MapReduce/paper.pdf · Google’s MapReduce Programming Model — Revisited Ralf Lammel¨ Data Programmability Team Microsoft

MapReduce and Hadoop File Systemnsrit.edu.in/admin/img/cms/10096mapreduce.pdf · The Outline Introduction to MapReduce From CS Foundation to MapReduce MapReduce programming model

Varnish and Drupal- Accelerating Website Performance and Flexibility with Varnish Cache

SIGMETRICS Tutorial: MapReducecourses.cs.vt.edu/~cs5204/.../MapReduce/MapReduce... · Introduction to MapReduce Programming Model Hadoop Map/Reduce Programming Tutorial and more

1. Introduction to MapReduce - UPMlsd.ls.fi.upm.es/.../IntroToMapReduce.pdf · Processing of massive data: MapReduce – 1. Introduction to MapReduce MapReduce has a 'low semantic

Fast Cache for Your Text: Accelerating Exact Pattern ...dga/papers/fastcache-tr.pdftime (building the trie) and reduces search speed because of poor cache performance. Another Boyer-Moore

Accelerating Big Data with Hadoop (HDFS, MapReduce and ... · •Overview of Hadoop (HDFS, MapReduce and HBase) and Memcached •Challenges in Accelerating Enterprise Middleware •Designs

Introduction to MapReduce | MapReduce Architecture | MapReduce Fundamentals

ACCELERATING ORACLE PERFORMANCE USING VSPHERE … · • Oracle Smart Flash Cache • Oracle Automatic Workload Repository VMware vSphere VMware vSphere, the industry-leading virtualization

Accelerating and Benchmarking Big Data Processing on Modern …€¦ · •MapReduce: Computing framework of Hadoop; highly scalable – Map tasks read data from HDFS, operate on

MapReduce Paradigm

Pipelined-MapReduce an Improved MapReduce

Accelerating HBase with NVMe and Bucket Cache

Hadoop and MapReduce - Courses · Hadoop and MapReduce Guest Lecturer: Jiaheng Lu ... Simple example: Word count Mapper (1-2) Mapper (3-4) ... MapReduce: Example. MapReduce in Parallel:

cache. ... cache

Accelerating virtualized Oracle 12c performance with vSphere 5.5 advanced features Flash Read Cache and vMotion

Distributed Cache With MapReduce

Accelerating your website with varnish cache