48
1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan Ghiţ, Alexandru Iosup, Dick Epema Institute: TU Delft, the Netherlands

1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

Embed Size (px)

Citation preview

Page 1: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

1Challenge the future

KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud EnvironmentsPresenter: Lipu FeiAuthors: Lipu Fei, Bogdan Ghiţ, Alexandru Iosup,

Dick EpemaInstitute: TU Delft, the Netherlands

Page 2: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

2Challenge the future

A Multicluster and Multicloud Environment

• Heterogeneous resources

• Efficient resource management

Page 3: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

3Challenge the future

A Multicluster and Multicloud Environment

• Increasing number of short jobs[1][2]

• Goal: optimize global job slowdown by optimizing short jobs

[1] Iosup, A; Epema, D., “Grid Computing Workloads”,Internet Computing, IEEE, vol.15, no.2, pp.19,26, March-April 2011[2] Yanpei Chen; Ganapathi, A; Griffith, R.; Katz, R., “The Case for Evaluating MapReduce Performance Using Workload Suites”, MASCOTS, 2011 IEEE 19th International Symposium, 25-27 July 2011

Page 4: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

4Challenge the future

Introduction

• KOALA-C scheduler for integrated multicluster and multicloud environments.

• Two new scheduling policies for the new environment.

• A comprehensive experimental evaluation of our architecture and policies.

Contribution

Page 5: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

5Challenge the future

KOALA: A Co-allocating Grid Scheduler

• Original goals:1. processor co-allocation: parallel applications2. data co-allocation: job affinity based on data locations3. load sharing: in the absence of co-allocation

• Additional goals:• research vehicle for grid and cloud research• support for (other) popular application types(e.g. MapReduce)

• KOALA:• is written in Java• is middleware independent (initially Globus-based)• has been deployed on the DAS2 – DAS4 since Sept. 2004

Page 6: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

6Challenge the future

DAS-4: The Distributed ASCI Supercomputer 4

• System purely for CS research• OpenNebula installed• Operational since Oct. 2010• Institutions and organizations:

• VU University, Amsterdam (VU)• Leiden University (LU)• University of Amsterdam (UvA)• Delft University of Technology (TUD)• The MultimediaN Consortium (UvA-MN)• Netherlands Institute for Radio Astronomy (ASTRON)

Page 7: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

7Challenge the future

System Model

• Global and Local resource managers (GRM and LRMs).

Page 8: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

8Challenge the future

System Model

• Global and Local resourcemanagers.

Page 9: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

9Challenge the future

System Model

• Global and Local resourcemanagers.

• Compute-intensive jobs, sequential or parallel high-performance applications.

• Rigid jobs that do not change size.

Page 10: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

10Challenge the future

System Model

• Global and Local resourcemanagers.

• Compute-intensive jobs, sequential or parallel high-performance applications.

• Rigid jobs that do not change size.

• Preemptive jobs (restart instead of resume).

Page 11: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

11Challenge the future

Outline

1. A Multicluster and Multicloud Environment2. KOALA-C: a System for MC + MC

Environments3. Design for Policies without prediction4. Simulation and Real-World Experiments5. Conclusion

Page 12: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

12Challenge the future

KOALA-C

Page 13: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

13Challenge the future

KOALA-C

Page 14: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

14Challenge the future

KOALA-C

Page 15: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

15Challenge the future

KOALA-C

Page 16: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

16Challenge the future

KOALA-C

Page 17: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

17Challenge the future

KOALA-C

Logical Partitioning

Page 18: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

18Challenge the future

Outline

1. A Multicluster and Multicloud Environment2. KOALA-C: a System for MC + MC Environments3. Design for Policies without prediction4. Simulation and Real-World Experiments5. Conclusion

Page 19: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

19Challenge the future

Policy Design Space

Prediction Slowdown Preemption MC+MC support

Uninformed(FF, RR)

Informed(SJF, SJF-ideal, HSDF)

TraditionalTAGS

TAGS extensions(TAGS-chain, TAGS-sets)

new

We propose TAGS-chain and TAGS-sets, which are new in this work.

Page 20: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

20Challenge the future

• Task Assignment by Guessing Size (TAGS)[3]:• use no prior information and use no prediction• jobs can be aborted• optimize short jobs

Traditional TAGS Policy

[3] Mor Harchol-Balter, “Task Assignment with Unknown Duration”, ACM 49, 2002

Page 21: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

21Challenge the future

TAGS-based Policy Design

• Goal: to achieve low slowdown without prediction• Method: to partition the sites into sets to serve jobs of

different runtime ranges

• A number of sets of sites• Set i allows jobs to run for Ti amount of time (Ti< Ti+1)• The last set has a T of ∞ (all jobs will finish without

being killed)

T1< T2< T3

T3= ∞

Page 22: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

22Challenge the future

TAGS-based Policy Design

Page 23: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

23Challenge the future

Policy DesignTAGS-based Policies in General

Page 24: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

24Challenge the future

Policy DesignTAGS-based Policies in General

Page 25: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

25Challenge the future

Policy DesignTAGS-chain and TAGS-sets

TAGS-chain:

TAGS-sets:

Page 26: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

26Challenge the future

Outline

• A Multicluster and Multicloud Environment• KOALA-C: a System for MC + MC Environments• Design for Policies without Prediction• Simulation and Real-World Experiments

• Simulation Results• Real-World Experiment Results

• Conclusion

Page 27: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

27Challenge the future

Simulation Results

• Resources:• 3 cluster sites (32 nodes each)• 1 private cloud site (≤64 VMs) and 1 public cloud site (≤128

VMs)• 6 policies: 5 baselines and TAGS-sets

• Workload:• 8 traces• job size limited to 32 nodes

• 6 Policies and 2 Metrics:• FF, RR, SJF, SJF-ideal, and HSDF as baselines + TAGS-sets• slowdown, total cost

• Simulation results agree with real-world experiment results

• * We have numerous experimental results in the paper

Experimental Setup

Parallel Workload Archive

Page 28: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

28Challenge the future

Simulation ResultsConfiguration of TAGS-sets

logscale

Page 29: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

29Challenge the future

Simulation ResultsConfiguration of TAGS-sets

More effect on slowdown

Page 30: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

30Challenge the future

Simulation ResultsConfiguration of TAGS-sets

More effect on slowdown

No common optimal runtime limit,but 40min seems to be suitable formost traces.

Page 31: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

31Challenge the future

Simulation ResultsPolicy Evaluation

Page 32: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

32Challenge the future

Simulation ResultsPolicy Evaluation

SJF-ideal has lowest slowdowns, but it cannot beachieved in reality.

Page 33: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

33Challenge the future

Simulation ResultsPolicy Evaluation

TAGS-sets has the lowest slowdowns overall

PWA-1PWA-2

PWA-3PWA-4

PWA-5GWA-1

GWA-2GWA-3

Page 34: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

34Challenge the future

Simulation ResultsPolicy Evaluation

>10x improvement over SJF

>25x improvement over SJF

PWA-1PWA-2

PWA-3PWA-4

PWA-5GWA-1

GWA-2GWA-3

Page 35: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

35Challenge the future

Simulation ResultsPolicy Evaluation

TAGS-sets can achieve good performance inmulticluster and multicloud environments.

PWA-1PWA-2

PWA-3PWA-4

PWA-5GWA-1

GWA-2GWA-3

Page 36: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

36Challenge the future

Simulation ResultsPolicy Evaluation

logscale

PWA-1PWA-2

PWA-3PWA-4

PWA-5GWA-1

GWA-2GWA-3

Page 37: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

37Challenge the future

Simulation ResultsPolicy Evaluation

TAGS-sets incurs 1.3 to 4.2x higher costs than the others

PWA-1PWA-2

PWA-3PWA-4

PWA-5GWA-1

GWA-2GWA-3

Page 38: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

38Challenge the future

Simulation ResultsPolicy Evaluation

TAGS-sets can achieve a good performance-cost tradeoff in an integrated multicluster and multicloud environment.

PWA-1PWA-2

PWA-3PWA-4

PWA-5GWA-1

GWA-2GWA-3

Page 39: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

39Challenge the future

Real-World Experiment Results

• Resources:• 2 cluster sites of the DAS-4 system (32 nodes each)• OpenNebula-based private cloud of DAS-4 (up to 32 VMs)• Amazon EC2 as public cloud (up to 64 VMs)

• Workload:• A part of the CTC-SP2 workload (≈12 hours)• 70% average utilization on the system (with the max

cloud size)

• Policies:• FF, SJF, and TAGS-sets• TAGS-sets setup:

Experimental Setup

≤10min

[4] James Patton Jones and Bill Nitzberg, “Scheduling for Parallel Supercomputing: A Historical Perspective of Achievable Utilization”, 1999

Page 40: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

40Challenge the future

Experimental Results

Page 41: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

41Challenge the future

Experimental Results

Lower and morebalanced slowdowns

Page 42: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

42Challenge the future

Experimental Results

Page 43: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

43Challenge the future

Experimental Results

Short jobs have shorter wait timesLong jobs have longer wait times

Page 44: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

44Challenge the future

Experimental Results

TAGS-sets has better optimization for short jobs than the other policies.

Page 45: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

45Challenge the future

Outline

• A Multicluster and Multicloud Environment• KOALA-C: a System for MC + MC Environments• Design for Policies without Prediction• Simulation and Real-World Experiments• Conclusion

Page 46: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

46Challenge the future

Conclusion

• We designed KOALA-C, a task scheduler for integrated multicluster and multicloud environments.

• We designed two new scheduling policies without prediction for multicluster and multicloud environments.

• Through experiments, we show that two new policies can achieve significant improvement in performance.

• For the future, we plan to extend our policy set.

Page 47: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

47Challenge the future

More Information• PDS group:

• Parallel and Distributed Systems Group @ TU Delft

• Publications:• see PDS publication database at:www.pds.ewi.tudelft.nl/research-publications/publications

• Home pages:• www.pds.ewi.tudelft.nl/epema• www.pds.ewi.tudelft.nl/~iosup

• Web sites:• KOALA: www.st.ewi.tudelft.nl/koala• DAS-4: www.cs.vu.nl/das4• GWA: gwa.ewi.tudelft.nl

Page 48: 1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan

48Challenge the future

Thank You

Questions?