Ali Raza Butt Rongmei Zhang Y. Charlie Hupeople.cs.vt.edu/~butta/docs/sc03condorSlides.pdf · Ali...

A Self-Organizing Flock of Condors

Ali Raza ButtRongmei ZhangY. Charlie Hu

{butta,rongmei,ychu}@purdue.edu

The need for sharing compute-cycles• Scientific applications

– Complex, large data sets

• Specialized hardware– Expensive

• Modern workstation– Powerful resource– Available in large numbers– Underutilized

Harness idle-cycles of network of workstations

Condor: High throughput computing

• Cost-effective idle-cycle sharing

• Job management facilities– Scheduling, checkpointing, migration

• Resource management– Policy specification/enforcement

• Solves real problems world-wide – 1200+ machines Condor pools, 100+ researchers

@Purdue

flocking

Sharing across pools: Flocking

Pre-configuredresource sharing

Central manager

Flocking

• Static flocking requires– Pre-configuration– Apriori knowledge of all remote pools

• Does not support dynamic resources

Our contribution:Peer-to-peer based dynamic flocking

• Automated remote Condor pool discovery

• Dynamic resource management– Support dynamic membership– Support changing local policies

Agenda

• Background: peer-to-peer networks• Proposed scheme• Implementation• Evaluation• Conclusions

Overlay Networks

P2P networks are self-organizing overlay networks without central control

ISP1 ISP2

Site 1

Site 4

Site 3Site 2 N

Advantages of structured p2p networks

• Scalable• Self-organization• Fault-tolerant• Locality-aware• Simple to deploy

• Many implementations available– E.g. Pastry, Tapestry, Chord, CAN…

Pastry: locality-aware p2p substrate

• 128-bit circular identifier space– Unique random nodeIds– Message keys

• Routing: A message is routed reliably to a node with nodeIdnumerically closest to the key

• Routing in overlay < 2 * routing in IP

Identifierspace

Agenda

Step 1:P2p organization of Condor pools

• Participating central managers join an overlay– Just need to know a single remote pool

• P2p provides self-organization– Pools can reach each other through the overlay– Pools can join/leave at anytime

13Central managers Resources

P2p organized central managers

Step 2:Disseminating resource information

• Announcements to nearby pools– Contain pool status information– Leverage locality-aware routing table

• Routing table has O(log N) entries matching increasingly long prefix of local nodeId

– Soft state• Periodically refreshed

Resource announcements

are physically close to

Step 3:Enable dynamic flocking

• Central managers flock with nearby pools– Use knowledge gained from resource announcements– Implement local policies– Support dynamic reconfiguration

17Central managers Resources

Interactions between central managers

Locality-awareflocking

Matchmaking

• Orthogonal to flocking• Condor matchmaking within a pool

• P2p approach affects the flocking decisions only

Are we discovering enough pools?

• Only subset of nearby pools reached using the Pastry routing table

• Multi-hop TTL based announcement forwarding

Agenda

Software

• Implemented as a daemon: poolD– Leverages FreePastry 1.3 from Rice – Runs on central managers– Manages self-organized Condor pools

• Condor version 6.4.7

• Interfaced to Condor configuration control

Software architecturep2p

network

Query services Configuration

p2p module

AnnouncementManager

Condor module

Policy Manager

FlockingManager

Agenda

Evaluation

• Measured results– Effect of flocking on job throughput

• Time spent in queue

– Four pools, three compute machines each– Synthetic job trace

Job trace

• Sequence– 100 (issue time: T, job length: L) pairs– Interval (Tn–Tn-1), L uniform distribution [1,17]– Designed to keep a single machine busy– Random overload/idle periods

• Trace– One or more job sequences merged together

PlanetLab experimental setup

U.C. Berkeley

Dynamic flocking

Interxion, GermanyRiceColumbia

Time spent in queue

72.100.0330.30557.550.03131.2012overall58.380.1028.37557.550.25284.915D64.480.1038.6897.170.0346.583C63.700.1332.6819.850.083.302B72.100.0320.1514.320.031.762A

maxminmeanmaxminmean

With flockingWithout flockingNo.of sequences in

tracehPool

Simulations

• 1000 Condor pools

• GT-ITM transit-stub model– 50 transit domains– 1000 stub domains

• Size of pool: uniform distribution [25,225]

• Number of sequences in trace:uniform distribution [25,225]

Cumulative distribution of locality

Total job completion time:without flocking

Total job completion time:with flocking

Agenda

Conclusions• Design and implementation of a self-

organizing flock of Condors– Scalability– Fault-tolerance– Locality-awareness which yields flocking with

nearby resources– Local sharing policy enforced

• P2p mechanisms provide an effective substrate for discovery and management of dynamic resources over the wide-area network

Questions?

What about security?• Authenticated pools / users

– Enforced by policy manager– Accountability

• Restricted access– Limited privileges e.g. UNIX user nobody– Condor libraries

• Controlled execution environment– Sandboxing – Process cleanups on job completion

• Intrusion detection

Ali Raza Butt Rongmei Zhang Y. Charlie Hupeople.cs.vt.edu/~butta/docs/sc03condorSlides.pdf · Ali...

Documents

SaaS, PaaS & TaaS By: Raza Usmani raza@eteamid.com razausmani@hotmail.com

GGES BUTTA SINGH WALA, School's Name … BUTTA SINGH WALA, KABIRWALA Authority DEO (Female) Post Name ESE Gender FEMALE Adv Sr # 66 Tehsil KABIRWALA School UC BOOTA SINGH WALA Village

Heustyle Blue Butta Border Zari Saree

Chiffon Gold Butta Moti by Mahasakthi Dupatta Chennai

Karim Raza: Ethics

The Essence of Folklore to foster Identity of Rongmei

SYED ADNAN ALI SHAH - pharmacy.uitm.edu.my · 2. Abdul Rehman Sadiq Butta, Muhammad Athar Abbasi, Aziz-ur-Rehman, Sabahat Zahra Siddiqui, Mubashir Hassan, Hussain Raza, Syed Adnan

Heustyle Butta Work Pink Zari Saree

BUTTA HOSPITALITY · BUTTA HOSPITALITY Butta Corporate O˜ce: Butta House, 2nd ˚oor, #4/14, Opp.Cyber Towers, Khanamet, Madhapur, Hyderabad -500081, A.P.India. Phone: 040 44229900/01/02

Picasso! Raza!! & Hussain!!!

MASOOM RAZA

RAZA FOUNDATION PROJECT OF RAZA ISLAMIA COLLEGE KORAL ISLAMAAD

Free Knitting Pattern Lion Brand® Feels Like Butta …...Isabella Bubble Blanket Pattern Number: L90212 Lion Brand® Feels Like Butta Bonus Bundle® Isabella Bubble Blanket Pattern

IMAM AHMAD RAZA - razanw.org · Imam Ahmad Raza& Topology 3 IMAM AHMAD RAZA AND TOPOLOGY Imam Ahmad Raza Khan Barelvi is known throughout the world by his personality and work

Mohsin Raza CV

A-Butta Press Kit

Raza de Porcino

Hazrat Imam Ali Raza (A.S) - IslamicBlessings.com Imam Ali Raza (A.S).pdf · Title: Hazrat Imam Ali Raza (A.S) Author: Majlis-e-Musanafeen Subject: Imam Ali Raza \(A.S\) Keywords:

Raza mr 3.24.14

Mashkoor Raza