Cmu 2011 09.pptx

10/11/11 © MapR Confiden0al 1

MapR, Implica0ons for Integra0on

CMU – September 2011

Outline

•  MapR system overview •  Map-‐reduce review •  MapR architecture •  Performance Results •  Map-‐reduce on MapR

•  Architectural implica0ons •  Search indexing / deployment •  EM algorithm for machine learning •  … and more …

Map-‐Reduce

$%&'()"*" +,&)!'%-(./%0)"*#

12'!!3)"*4 536'-3)!'%-(./%0)"*7

8'(&'()"930)"*:

;.<'=3)")>&=./=),=(?

@/-,9)A.0B

Input Output

Shuffle

BoQlenecks and Issues

•  Read-‐only files •  Many copies in I/O path •  Shuffle based on HTTP •  Can’t use new technologies •  Eats file descriptors

•  Spills go to local file space •  Bad for skewed distribu0on of sizes

MapR Areas of Development

Map Reduce

Storage Services

Ecosystem

Management

MapR Improvements

•  Faster file system •  Fewer copies •  Mul0ple NICS •  No file descriptor or page-‐buf compe00on

•  Faster map-‐reduce •  Uses distributed file system •  Direct RPC to receiver •  Very wide merges

MapR Innova0ons

•  Volumes •  Distributed management •  Data placement

•  Read/write random access file system •  Allows distributed meta-‐data •  Improved scaling •  Enables NFS access

•  Applica0on-‐level NIC bonding •  Transac0onally correct snapshots and mirrors

MapR's Containers

l  Each container contains l  Directories & files l  Data blocks

l  Replicated on servers l  No need to manage

directly

Files/directories are sharded into blocks, which are placed into mini NNs (containers ) on disks

Containers are 16-‐32 GB segments of disk, placed on nodes

MapR's Containers

l  Each container has a replica0on chain

l  Updates are transac0onal l  Failures are handled by rearranging replica0on

Container loca0ons and replica0on

N1, N2 N3, N2

N1, N2

N1, N3

N3, N2

N3 Container loca0on database (CLDB) keeps track of nodes hos0ng each container and replica0on chain order

MapR Scaling Containers represent 16 -‐ 32GB of data

l  Each can hold up to 1 Billion files and directories l  100M containers = ~ 2 Exabytes (a very large cluster)

250 bytes DRAM to cache a container l  25GB to cache all containers for 2EB cluster

-  But not necessary, can page to disk l  Typical large 10PB cluster needs 2GB

Container-‐reports are 100x -‐ 1000x < HDFS block-‐reports l  Serve 100x more data-‐nodes l  Increase container size to 64G to serve 4EB cluster

l  Map/reduce not affected

MapR's Streaming Performance

Read Write0

HardwareMapRHadoopMB

per sec

Tests: i. 16 streams x 120GB ii. 2000 streams x 1GB

11 x 7200rpm SATA 11 x 15Krpm SAS

Higher is be;er

Terasort on MapR

1.0 TB0

3.5 TB0

MapRHadoop

Elapsed =me (mins)

10+1 nodes: 8 core, 24GB DRAM, 11 x 1TB SATA 7200 rpm

Lower is be;er

HBase on MapR

Records per

second

Higher is be;er 0

Zipfian Uniform

Apache

YCSB Random Read with 1 billion 1K records 10+1 node cluster: 8 core, 24GB DRAM, 11 x 1TB 7200 RPM

# of files (m)

Op: -‐ create file -‐ write 100 bytes -‐ close

Notes:

-‐ NN not replicated

-‐ NN uses 20G DRAM

-‐ DN uses 2G DRAM

Out of box

Small Files (Apache Hadoop, 10 nodes)

MUCH faster for some opera0ons

# of files (millions)

Create Rate

Same 10 nodes …

What MapR is not

•  Volumes != federa0on •  MapR supports > 10,000 volumes all with independent placement and defaults

•  Volumes support snapshots and mirroring •  NFS != FUSE •  Checksum and compress at gateway •  IP fail-‐over •  Read/write/update seman0cs at full speed

•  MapR != maprfs

New Capabili0es

Alterna0ve NFS moun0ng models

•  Export to the world •  NFS gateway runs on selected gateway hosts

•  Local server •  NFS gateway runs on local host •  Enables local compression and check summing

•  Export to self •  NFS gateway runs on all data nodes, mounted from localhost

Export to the world

NFS Server NFS Server NFS Server NFS Server NFS

Client

NFS Server

Local server

Applica0on

Cluster Nodes

Cluster Node

NFS Server

Universal export to self

Cluster Nodes

Cluster Node

NFS Server

Cluster Node

NFS Server

Cluster Node

NFS Server

Nodes are iden0cal

Applica0on architecture

•  High performance map-‐reduce is nice

•  But algorithmic flexibility is even nicer

Sharded text Indexing

Map Reducer

Input documents

Local disk Search

Engine Local disk

Clustered index storage

Assign documents to shards

Index text to local disk and then copy index to distributed file store

Copy to local disk typically required before index can be loaded

Sharded text indexing

•  Mapper assigns document to shard •  Shard is usually hash of document id

•  Reducer indexes all documents for a shard •  Indexes created on local disk •  On success, copy index to DFS •  On failure, delete local files

•  Must avoid directory collisions •  can’t use shard id!

•  Must manage and reclaim local disk space

Conven0onal data flow

Map Reducer

Input documents

Local disk Search

Engine Local disk

Clustered index storage

Failure of a reducer causes garbage to accumulate in the

local disk

Failure of search engine requires

another download of the index from clustered storage.

Search Engine

Simplified NFS data flows

Map Reducer

Input documents Clustered

index storage

Failure of a reducer is cleaned up by map-‐reduce framework

Search engine reads mirrored index directly.

Index to task work directory via NFS

Simplified NFS data flows

Map Reducer

Input documents

Search Engine

Mirrors

Search Engine

Mirroring allows exact placement of index data

Aribitrary levels of replica0on also possible

How about another one?

K-‐means

•  Classic E-‐M based algorithm •  Given cluster centroids, •  Assign each data point to nearest centroid •  Accumulate new centroids •  Rinse, lather, repeat

Aggregate new

centroids

K-‐means, the movie

Assign to

Nearest centroid

Centroids

I n p u t

But …

Average models

Parallel Stochas0c Gradient Descent

Train sub

I n p u t

Update model

Varia0onal Dirichlet Assignment

Gather sufficient sta0s0cs

I n p u t

Old tricks, new dogs

•  Mapper •  Assign point to cluster •  Emit cluster id, (1, point)

•  Combiner and reducer •  Sum counts, weighted sum of points •  Emit cluster id, (n, sum/n)

•  Output to HDFS

Read from HDFS to local disk by distributed cache

WriQen by map-‐reduce

Read from local disk from distributed cache

Old tricks, new dogs

•  Mapper •  Assign point to cluster •  Emit cluster id, (1, point)

•  Combiner and reducer •  Sum counts, weighted sum of points •  Emit cluster id, (n, sum/n)

•  Output to HDFS MapR FS

Read from NFS

WriQen by map-‐reduce

Poor man’s Pregel

•  Mapper

•  Lines in bold can use conven0onal I/O via NFS

while not done:! read and accumulate input models! for each input:! accumulate model! write model! synchronize! reset input format!emit summary!

Click modeling architecture

Feature extrac0on

and down

sampling

I n p u t

Side-‐data

Data join

Sequen0al SGD

Learning

Map-‐reduce

Now via NFS

Click modeling architecture

Map-‐reduce Map-‐reduce

Feature extrac0on

and down

sampling

I n p u t

Side-‐data

Data join

Sequen0al SGD

Learning

Map-‐reduce cooperates with NFS

Sequen0al SGD

Learning

Sequen0al SGD

Learning

Sequen0al SGD

Learning

And another…

Hybrid model flow

Map-‐reduce

Feature extrac0on and

down sampling

SVD (PageRank) (spectral)

Deployed Model

Down stream modeling

Map-‐reduce Sequen0al

Hybrid model flow

Feature extrac0on and

down sampling

SVD (PageRank) (spectral)

Deployed Model

Down stream modeling

And visualiza0on…

Trivial visualiza0on interface

•  Map-‐reduce output is visible via NFS

•  Legacy visualiza0on just works

$ R!> x <- read.csv(“/mapr/my.cluster/home/ted/data/foo.out”)!> plot(error ~ t, x)!> q(save=‘n’)!

Conclusions

•  We used to know all this •  Tab comple0on used to work •  5 years of work-‐arounds have clouded our memories

•  We just have to remember the future

Cmu 2011 09.pptx

Technology

Advanced Introduction to Machine Learning, CMU-10715 · 2014-09-17 · Advanced Introduction to Machine Learning, CMU-10715 Perceptron, Multilayer Perceptron Barnabás Póczos, Sept

CMU SCS Multimedia and Graph mining Christos Faloutsos CMU

09-Acute Inflammation.morphology, Pptx

[MS-PPTX]: PowerPoint (.pptx) Extensions to the Office ...MS-PPTX].pdf · [MS-PPTX]: PowerPoint (.pptx) Extensions to the Office Open XML File Format ... PowerPoint (.pptx) Extensions

CEMAP: General Usage Guidereports-archive.adm.cs.cmu.edu/anon/isr2009/CMU-ISR-09...CEMAP: General Usage Guide Terrill L. Frantz & Kathleen M. Carley May 15, 2009 CMU-ISR-09-116 Institute

Ryan O'Donnell (CMU, IAS) Yi Wu (CMU, IBM) Yuan Zhou (CMU)

Nicolas Christin, CMU INI/CyLab Sally S. Yanagihara, CMU INI/CyLab Keisuke Kamataki, CMU CS/LTI

CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 1 Graph Analytics Workshop: Tools Christos Faloutsos CMU

09-14 - Risk Control - Part 02.pptx

FINAL 5.17.18 JAS Interpersonal CMU SMPS Branded PPT Deck … · Microsoft PowerPoint - FINAL 5.17.18 JAS Interpersonal CMU SMPS Branded PPT Deck 2018.pptx Author: JAS Created Date:

Introduction to Machine Learning - Alex Smolaalex.smola.org/.../slides/11_Learning_Theory.pdf · 2013-09-09 · Introduction to Machine Learning CMU-10701 . 11. Learning Theory

CMU SCS Big (graph) data analytics Christos Faloutsos CMU

[FT1 - group 5] Workflow redesign (xuan pham's conflicted copy 2013-03-09).pptx

Advanced Structural Design - Lecture note 09 P2.pptx

A Proof-Carrying File Systemfp/papers/CMU-CS-09-123.pdfA Proof-Carrying File System Deepak Garg and Frank Pfenning June 6, 2009 CMU-CS-09-123 School of Computer Science Carnegie Mellon

CMU SCS Data Mining on Streams Christos Faloutsos CMU

[MS-PPTX]: PowerPoint (.pptx) Extensions to the Office ...interoperability.blob.core.windows.net/files/MS-PPTX/[MS-PPTX... · 1 / 76 [MS-PPTX] — v20140428 PowerPoint (.pptx) Extensions

CMU SCS Graph and stream mining Christos Faloutsos CMU

8 Fact Book 2008 09 Peer Comparisons - CMU

CMU SCS Mining Billion-node Graphs Christos Faloutsos CMU