45
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Minoo Duraipandy AWS Solution Architect June 30, 2016 Cost Optimization at Scale (Building and Realizing the Economic Case for the AWS Cloud)

Cost Optimization at Scale

Embed Size (px)

Citation preview

Page 1: Cost Optimization at Scale

© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Minoo Duraipandy

AWS Solution Architect

June 30, 2016

Cost Optimization at Scale

(Building and Realizing the Economic Case for the AWS Cloud)

Page 2: Cost Optimization at Scale

What to expect….

We will introduce our approach for building

the business case for moving to the cloud

and share tips from some of our most

innovative customers who are able to

successfully architect for cost optimization

in order to realize the economics of the

AWS cloud.

Page 3: Cost Optimization at Scale

In the beginning . . .

…there was TCO

Page 4: Cost Optimization at Scale

What is TCO?

Definition: Comparative total cost of ownership analysis (acquisition

and operating costs) for running an infrastructure environment end-to-end

on-premises vs. on AWS.

Used for:

1) Comparing the costs of running an entire infrastructure environment or

specific workload on-premises or in a co-location facility vs. on AWS

2) Budgeting and building the business case for moving to AWS

Page 5: Cost Optimization at Scale

So how do we do it?

Page 6: Cost Optimization at Scale

TCO = acquisition costs + operations costs

Hardware—server, rack

chassis PDUs, Tor switches

(+maintenance)

Software—OS,

virtualization licenses

(+maintenance)

Facilities cost

Hardware—storage disks,

SAN/FC switchesStorage admin costs

Network hardware—LAN

switches, load balancer

bandwidth costsNetwork admin costs

Server admin virtualization admin4

The diagram doesn’t include every cost item. For example, software costs can include database,

management, and middle-tier software costs. Facilities cost can include costs associated with upgrades,

maintenance, building security, taxes, and so on. IT labor costs can include security admin and application

admin costs.

Space Power Cooling

Facilities cost

Space Power Cooling

Facilities cost

Space Power Cooling

Server costs

Storage costs

Network costs

IT labor costs

1

2

3

illustrative

Page 7: Cost Optimization at Scale

Resources to get you started

AWS TCO Calculator

https://awstcocalculator.com

Case studies and research

http://aws.amazon.com/economics/

Page 8: Cost Optimization at Scale

So you’re feeling pretty good.

Page 9: Cost Optimization at Scale

Until your CFO shows up with the bill.

Page 10: Cost Optimization at Scale

Cost optimization is…

going from…

to…

pay for what you use

pay for what you need

Page 11: Cost Optimization at Scale

Where do you start?

Page 12: Cost Optimization at Scale

What levers did they pull?

Page 13: Cost Optimization at Scale

Commercial Cost Optimization Levers

Right-sizingReserved

Instances

Increase

elasticity

Measure,

monitor, and

improve

Page 14: Cost Optimization at Scale

Right-sizing

Right-sizing

• Selecting the cheapest instance available

while meeting performance requirements

• Looking at CPU, RAM, storage, and network

utilization to identify potential instances that

can be downsized

• Leveraging Amazon CloudWatch metrics and

setting up custom RAM metrics

Rule of thumb: Right size, then reserve.(But if you’re in a pinch, reserve first.)

Page 15: Cost Optimization at Scale

Reserved Instances

Commitment level1 year

3 year

AWS services offering RIsAmazon EC2

Amazon RDS

Amazon DynamoDB

Amazon Redshift

Amazon ElastiCache

* Dependent on specific AWS service, size/type, and region

Page 16: Cost Optimization at Scale

Reserved Instances

Step 1: RI Coverage

• Cover always-on resources.

Step 2: RI Utilization

• Leverage RI flexibility to increase utilization.

• Merge and split RIs as needed.

Rule of thumb: Target 70–80% always-on

coverage and 95% RI utilization rate.

Page 17: Cost Optimization at Scale

Increase elasticity

Turn off nonproduction instances

• Look for dev/test, nonproduction instances that

are running always-on and turn them off.

Autoscale production

• Use Auto Scaling to scale up and down based

on demand and usage (for example, spikes).

Rule of thumb: Shoot for 20–30% of Amazon EC2

instances running on demand to be able to

handle elasticity needs.

Page 18: Cost Optimization at Scale

Using right-sizing and elasticity to lower cost

More smaller instances vs. fewer larger instances

29 m4.large @ $0.12 /hr

$2,505.60 / mo*

59 t2.medium @ $0.052/hr

$2,208.96 / mo*

*Assumes Linux instances in US-East at 720 hours per month

Page 19: Cost Optimization at Scale

Putting it all together: case study

Page 20: Cost Optimization at Scale

Example

A Technology Company

Page 21: Cost Optimization at Scale

A Technology Company

In the last three

months…

Page 22: Cost Optimization at Scale

A Technology Company

Doubled the CPU

and traffic used by

its Web servers

Page 23: Cost Optimization at Scale

A Technology Company

Reduced its

instance spend

by 33%

$72k saving per month!

Page 24: Cost Optimization at Scale

Challenge:

Minimizing unit costs

under period of massive

growth.

A consistent measure of

CPU processing power

Elastic compute unit

(ECU)

Page 25: Cost Optimization at Scale

The growth challenge

August 2014

August 2015

584 ECU

1,192 ECU

2x YoY Compute Growth

33% decrease in monthly

EC2 costs!

Page 26: Cost Optimization at Scale

Solving the growth challenge

Page 27: Cost Optimization at Scale

Step 1: Right-size and update instances

m1 on demand

$0.07 per ECU

c4 on demand

$0.02 per ECU

Page 28: Cost Optimization at Scale

The impact of right-sizing

70% reduction

in unit cost

Page 29: Cost Optimization at Scale

Step 2: Reserve

Page 30: Cost Optimization at Scale

The impact of reservations

30% reduction

In unit cost

Page 31: Cost Optimization at Scale

Putting it together

85% reduction

in unit cost!

Page 32: Cost Optimization at Scale

Sounds pretty easy, right?

Not really.

In reality, it is very complex.

• Scale

• Behavioral change

• Visibility

• Ownership

Page 33: Cost Optimization at Scale

Cost optimization governance

(Remember the fourth pillar?)

Page 34: Cost Optimization at Scale

Uncovering the cost optimization opportunities

1. Auto-tag resources.

2. Identify always-on nonprod.

3. Identify instances to down-size.

4. Recommend RIs to purchase.

5. Dashboard our status.

6. Report on savings.

Page 35: Cost Optimization at Scale

Action Changes

1. Allocate costs by tag &

account

2. Turn off Non-Prod instances

daily

3. Quickly change instance

sizes

4. Move underutilized RIs

Automation

What we need to do

Page 36: Cost Optimization at Scale

AWS options

Page 37: Cost Optimization at Scale

Reserved Instances and right-sizing options

Page 38: Cost Optimization at Scale

Example: reasonable optimization dashboard

Page 39: Cost Optimization at Scale

Creating a culture of cost transparency

Targets and metrics Cloud Competency

Center

AWS Enterprise

Support

Page 40: Cost Optimization at Scale

Putting it all together

Page 41: Cost Optimization at Scale

Where to start

Set up a Cloud

Competency Center

Bring in the right

tools

Use metrics to

reinforce behavior

Use partners to

accelerate!

Page 42: Cost Optimization at Scale

Metrics and targets

% instances turned off daily

% of instances right-sized

% always-on resources covered by RIs

% RI utilization

✔ ✔

✔ ✔

Set up metrics to define success and track progress

Page 43: Cost Optimization at Scale

Cycle of cost optimization

✔✘

$

$

$

$

$

Page 44: Cost Optimization at Scale

Handy Tools

Move RIs automatically

https://github.com/jros2300/reservedinstances

Tableau Templates

https://github.com/evancraw/AWSOptimizationTemplates(Dashboards, right-sizing, reserved capacity)

GorillaStack for Automatic Tagging

Start / Stop Non-Prod Daily

ape.gs/PowerCycleReInvent

http://ape.gs/AWSAutoTag

Page 45: Cost Optimization at Scale

Thank You!