Power-aware Resource Allocation for Cpu - and Memory Intense Internet Services

Power-aware Resource Allocation for Cpu- and Memory Intense

Internet ServicesVlasia Anagnostopoulou (vlasia@cs.ucsb.edu),

Susmit Biswas, Heba Saadeldeen, Ricardo Bianchini, Tao Yang, Diana Franklin, Frederic T. Chong

University of California, Santa Barbara

First E2DC Workshop08/05/2012

Cpu- and Memory Intense Internet-Services

MapReduce, Hadoop,…

• Latency-bound• Intense

computation (=>high cpu utilization)

• Petascale data

Datacenter clusters

Datacenter cluster operation

Challenges• Standard middleware algorithms are inefficient

for cpu- and memory-intense internet services• Resource allocation operates at a fine-

granularity– But is oblivious of the SLA

• Power management is SLA-aware– But is only driven by the CPU– Coarse-grained

• Request distribution does not operate at a resource granularity

Overview of solution

• SLA-aware and fine-grained• Two steps:– Configure states of servers (basic power-aware resource

allocation)– Allocate resources to servers (cpu and memory)

Resource Allocation

Power Management

Request distribution

Power-aware Resource Allocation for cpu and memory

Adjusted Request distribution

Standard Middleware Optimized Middleware

Contents

• Introduction• Power-aware Resource Allocation– Basic– With Support for Multiple Applications– Adjusted Request Distribution

• Methodology• Experiments• Conclusion

Basic Power-aware Resource Allocation

• Configure server states: – Active, off, low-power state

• Problem of memory being inaccessible– Internet-services have high memory demand (for

caching)• Solution: use a memory-active, low-power

state (barely-alive)– Memory is on– Server is not operational, but memory can be

remotely accessed– Memory contributes to global cache

Details of Barely-alive state

Basic Power-aware Resource Allocation

• Calculations:• Active servers to service load– N_cpu_act = Load_demand / Cpu_capacity

• Memory-active servers to satisfy memory demand– Active or barely-alive– N_mem_act = Memory_demand/ Mem_capacity

• Configure to maximize energy savings, or to maximize memory allocation

Example• N=5 servers• Cpu-capacity = 1,000 conn.• Mem-capacity = 1GB• Load = 3,000 conn.• Target mem-alloc = 4GB• Maximize energy-savings:

• Maximize memory alloc.:• Mem. usage: 0.8GB/server

• How to control the memory allocation?

Memory Allocation for SLA• Two objectives:• 1) Allocate memory for SLA• 2) Share memory among services with SLA

guarantees– Must be fair; accept priority– Guarantee minimum performance

• Characteristics:• Uniform allocation per server (to avoid

imbalance)• Memory performance monitoring capability

which is SLA-aware

Memory allocation for SLA

• Utilize stack algorithm [Mattson]– Measures contribution of memory

size to the hit-rate– Hit-rate is used as proxy of

performance• Server-level: Calculate alloc for

target-hit-rate– Attach SLA mapping

• Cluster-level: calculate avg size for target hit-rate

• How to allocate memory when constrained?

Size Hits Hit-ratio

1 6/9 66.7%

2 1/9 77.8%

3 0/9 77.8%

Server Size

1 22 2… …5 2

Avg: 2

SLA/Memory Sharing• Aggregate metric of performance

– sum of allocations which yield performance closest to SLA• Linear optimization problem to maximize aggregate

performance: • at each step, allocate memory s.t. to minimize aggregate

performance• subject to memory capacity constraint• guarantee min SLA for each app

Size Hit-ratio

1 66.7% #3

2 77.8% #2

3 77.8% #2

{app1, app2} => Target SLA {#2, #2}

dist_to_SLA_alloc = ∞ dist_to_SLA_alloc = ∞

dist_to_SLA_alloc = 1 dist_to_SLA_alloc = 1

dist_to_SLA_alloc = 0 dist_to_SLA_alloc = 0

Request Distribution

Processing…

Adjusted Request Distribution

Processing…

Contents

• Introduction• Power-aware Resource Allocation– Basic– With Support for Multiple Applications– Adjusted Request Distribution

• Methodology– Simulator– Traces

• Experiments• Conclusion

Methodology• Datacenter-cluster simulator:– 1 rack– trace-based functional simulator

• Simulate all standard and proposed middleware algorithms

• Traces:– Internet-search “snippet”

generator

Contents• Introduction• Power-aware Resource Allocation

– Basic– With Support for Multiple Applications– Adjusted Request Distribution

• Experiments– Basic Algorithm– Shared Cluster

• Conclusion

Experiments – Basic Algorithm• Evaluate various configuration objectives:

• Barely-alive: maximize memory allocation; Mixed: maximize energy savings

• Fix SLA, evaluate energy savings only. Also, evaluate residual memory.

• SLA #1, #2, #3: Response time degradation 1-2%, 2-3%, 3-4%

• Aggressiveness of consolidation: 50, 70, 85%

System Active Off Barely-alive

Baseline Y N NOn/Off Y Y N

BA Y N YMixed Y Y Y

Results – basic algorithm

• Mixed system has highest energy savings; up to 42% (24% over On/Off)

• BA: up to 34% (20% over On/Off)

Results – basic algorithm

• Mixed system is most stable• In barely-alive system savings depend on the SLA level;

can push the parameter for savings aggressiveness• On/off system savings are influenced by both

parameters. Degrade significantly at high SLA levels

Results - Basic algorithm• BA: up to extra 7.5GB memory: allocate to

another application, transition to low-power etc

Results – Cluster Sharing

Results – Cluster sharing

Contents• Introduction• Power-aware Resource Allocation

– Basic– With Support for Multiple Applications– Adjusted Request Distribution

• Experiments– Basic Algorithm– Shared Cluster

• Conclusion

Conclusion• Combine power management and resource

allocation => power-aware resource allocation• SLA-driven, fine grained management of

datacenter clusters– Performance guarantees + energy savings

• Flexibility to different optimizations for datacenter scenarios

• Achieve deep energy savings or potential for more memory utility out of cluster

• Holistic design of middleware software

Thank you for your attention!!!

Questions?

Contact:vlasia@cs.ucsb.edu

URL: www.cs.ucsb.edu/~vlasia

Power-aware Resource Allocation for Cpu - and Memory Intense Internet Services

Documents

Intense experience

Intense Pink

Omron - CS1D-CPU H CPU Units CS1D-CPU S CPU Units ......CS1D CPU Units Function CS1D-CPU@@H CS1D-CPU@@S Duplex CPU, Single I/O Expansion System Duplex CPU, Dual I/O Expan-sion System

CPU-CPU Communication with SIMATIC Controllers

CPU Performance Pipelined CPU - Cornell University

CPU CPU 81,349 CPU CPU X86 CPU CPU CPU CPU CPU ) , CPU

CJ2H-CPU6 -EIP CJ2H-CPU6 CJ2M-CPU CJ2 CPU Unit Hardware · 2020-02-18 · Software setting methods for the CPU Unit (including I/O memory allocation, PLC Setup settings, Special I/O

DDH INVESTMENT ACCESS FUNDS · 2016. 10. 11. · Distributions paid and payable 138,990 461,960 854,974 815,972 365,243 316,091 449,310 627,142 CPU CPU CPU CPU CPU CPU CPU CPU Distributions

Intense Concentration

1 Further Demands on beam Intense, Intense, Intense, ….Intense, Intense, Intense, …. –Very far detector, extremely small cross section, search small osci

Register Allocation Via Graph Coloring · 17/04/2015 · Register Allocation is the process of assigning a large number of target program variables onto a small number of CPU registers

Energy balance during intense and super-intense magnetic storms

CJ1H-CPU H-R, CJ1G/H-CPU H, CJ1G-CPU P, CJ1G-CPU , CJ1M ...€¦ · the CS1D CPU Units for Single-CPU Systems, begins at version 2.0. • The unit version of the CS1D CPU Units for

Adaptive CPU Allocation for Software based Router Systems

Adaptive CPU Allocation for Software based Router Systems Puneet Zaroo

rubio nacarado intenso blond nacré intense 8.22 8Bv light intense beige blonde rubio claro nacarado intenso blond clair nacré intense 9.22 9Bv very light intense beige blonde

RHBC 164: Faithful: Intense Times Require Intense Prayer

The CPU Scheduling CPU Scheduling

Cloud Based Application Architectures using Smart Computing...CPU and memory usage through complete data center management and resource allocation. The result is that Joyent Smart

Siemens - S7-300 Instruction List CPU 31x, CPU 31x, …...S7-300 Instruction List CPU 31xC, CPU 31x, IM 151-7 CPU, IM 151-8 CPU, IM 154-8 CPU, BM 147-1 CPU, BM 147-2 CPU This instruction