28
Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler, Randy Katz 1

Design and Analysis of an Energy Agile Cluster Computing System

  • Upload
    azana

  • View
    17

  • Download
    0

Embed Size (px)

DESCRIPTION

Design and Analysis of an Energy Agile Cluster Computing System. Andrew Krioukov , Prashanth Mohan, Stephen Dawson-Haggerty, Sara Alspaugh , David Culler, Randy Katz. Grid Evolution. renewable, variable, intermittent, greatly non-dispatchable. non-renewable, reactive, dispatchable. - PowerPoint PPT Presentation

Citation preview

Page 1: Design and Analysis of  an Energy Agile Cluster  Computing System

1

Design and Analysis of an Energy Agile Cluster Computing SystemAndrew Krioukov, Prashanth Mohan, Stephen Dawson-Haggerty, Sara Alspaugh, David Culler, Randy Katz

Page 2: Design and Analysis of  an Energy Agile Cluster  Computing System

2

Grid Evolution

SUPPLIES

LOADS

mostly dispatchable

renewable, variable, intermittent, greatly non-dispatchable

oblivious, stochastic, mostly non-power proportional

reactive, mostly power proportional

TODAY IDEAL FUTURE

oblivious, flat

OLD GRID

non-renewable, reactive, dispatchable

Page 3: Design and Analysis of  an Energy Agile Cluster  Computing System

3

Grid Evolution

SUPPLIES

LOADS

mostly dispatchable

renewable, variable, intermittent, greatly non-dispatchable

oblivious, stochastic, mostly non-power proportional

reactive, mostly power proportional

TODAY IDEAL FUTURE

oblivious, flat

OLD GRID

non-renewable, reactive, dispatchable

Page 4: Design and Analysis of  an Energy Agile Cluster  Computing System

4

Grid Evolution

SUPPLIES

LOADS

mostly dispatchable

renewable, variable, intermittent, greatly non-dispatchable

oblivious, stochastic, mostly non-power proportional

power proportional, reactive, grid-aware

TODAY IDEAL FUTURE

oblivious, flat

OLD GRID

non-renewable, reactive, dispatchable

Page 5: Design and Analysis of  an Energy Agile Cluster  Computing System

5

Grid

Internet

SUPPLIES:provide powercommunicate renewable availability, price

LOADS:adapt demand

communicate forecast

electricity

information

Pieces Needed

?

Page 6: Design and Analysis of  an Energy Agile Cluster  Computing System

6

Non-dispatchable, variable supply

Power proportional, grid-aware loads

NREL Western Wind and Solar Integration Study Datasethttp://wind.nrel.gov/Web_nrel/

Pacheco wind farm

Scientific computing cluster

Figure of merit: amount of wind used.How do we get here?

Renewable Integration

Page 7: Design and Analysis of  an Energy Agile Cluster  Computing System

7

POW

ER

TIME

oblivious, flat load

dispatchable supply

power proportionality

grid-awareness

Page 8: Design and Analysis of  an Energy Agile Cluster  Computing System

8

Data Center Loads

IT Equip-ment 61%

Cooling30%

Power Cond.8%

Light1%

data center consumption dominated by IT loadIT load driven by workload

need power proportionalityneed load shaping mechanism

Server Idle:Peak

HP ProLiant DL160

63.5%

Apple XServe 3.1 51.8%IBM System x3450

51.6%

Dell PowerEdge 2950

57.9%

Pelley, et. al, Understanding and Abstracting Total Data Center Power, 2009Barroso et. al. The Case for Energy-Proportional Computing, 2007

SPECpower Results http://www.spec.org/power_ssj2008/results/power_ssj2008.html

5,000 servers at Googleaverage 30% utilization

IT equipment is not power proportional

powe

r (W

)

utilization

Page 9: Design and Analysis of  an Energy Agile Cluster  Computing System

9

Power Proportionality

Spinning Reserve

Page 10: Design and Analysis of  an Energy Agile Cluster  Computing System

10

Architecture

Page 11: Design and Analysis of  an Energy Agile Cluster  Computing System

11

Outline• Motivation• Enabling technology• Methodology• Algorithms• Evaluation

Page 12: Design and Analysis of  an Energy Agile Cluster  Computing System

12

Renewable Energy Component

Page 13: Design and Analysis of  an Energy Agile Cluster  Computing System

13

Formulation

We assume the wind farm is sized for the data center.

Option 1: grid blend (open system)

Wind

Other

Requires assuming load is negligible fraction of grid – not realistic

Option 2: dedicated wind farm (closed system)

Fit load to specific wind farm

http://www.greenhousedata.com/

Page 14: Design and Analysis of  an Energy Agile Cluster  Computing System

14

Wind

Wind power over 48 hours from a wind farm in Monterrey County, California.

Variation in wind power for month long intervals at multiple wind farms.

Page 15: Design and Analysis of  an Energy Agile Cluster  Computing System

15

Workload Component

Page 16: Design and Analysis of  an Energy Agile Cluster  Computing System

16

Workloads

Torque jobs

Num

Jobs

Batch: Less latency sensitive, longer jobse.g., analytics, scientific computing

Requ

est R

ate

Wikipedia traffic

Interactive: Latency sensitive, generally short jobse.g., web app server, email server, etc.

Page 17: Design and Analysis of  an Energy Agile Cluster  Computing System

23

Slackslack = max run time – job duration

Page 18: Design and Analysis of  an Energy Agile Cluster  Computing System

24

Cluster: NERSC FranklinAverage duration: 98 minAverage slack: 68 min

Cluster: EECS PSIAverage duration: 55 minAverage slack: 17 hours

Slack in Real Systems

Page 19: Design and Analysis of  an Energy Agile Cluster  Computing System

25

Grid-Aware Batch Scheduling• example goal:

shape load to match wind availability

• method: exploit temporal slack

Pacheco wind farm

Scientific computing cluster

Page 20: Design and Analysis of  an Energy Agile Cluster  Computing System

26

Greedy AlgorithmB(t) = power budget for next 10 min

Sort jobs by slack

Schedule all jobs with no remaining slack

Schedule other eligible jobs in least-remaining-slack order until B(t) is exceeded

Page 21: Design and Analysis of  an Energy Agile Cluster  Computing System

27

Run-immediately, grid-oblivious scheduler

Greedy, grid-aware scheduler

Grid-aware scheduling increases wind energy use.

Correspondingly, reduces grid dependence.

Page 22: Design and Analysis of  an Energy Agile Cluster  Computing System

29

Reduction in grid dependence is robust to choice of wind farm.

Page 23: Design and Analysis of  an Energy Agile Cluster  Computing System

30

Page 24: Design and Analysis of  an Energy Agile Cluster  Computing System

31

As slack increases, grid dependence diminishes.

PSI Franklin

Page 25: Design and Analysis of  an Energy Agile Cluster  Computing System

32

Grid-aware scheduling is equivalent to 5 hours worth of data center-sized batteries.

Page 26: Design and Analysis of  an Energy Agile Cluster  Computing System

33

Grid-aware scheduling is equivalent to 5 hours worth of data center-sized batteries.

4

Page 27: Design and Analysis of  an Energy Agile Cluster  Computing System

34

Summary• Power proportionality and grid-aware

scheduling• Energy savings, renewable integration,

grid stabilityreduce grid dependence by halfequivalent to 5 hours of batteries

• Next stepsslack in other systems...?

Page 28: Design and Analysis of  an Energy Agile Cluster  Computing System

QUESTIONS?THE END

35