71
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Alan Milford, Systems Architect, AOL Platforms Ilan Rabinovitch, Director of Technical Community, Datadog November 30, 2016 DEV202 From Dial-up to DevOps AOL's Migration to the Cloud

AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Embed Size (px)

Citation preview

Page 1: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Alan Milford, Systems Architect, AOL Platforms

Ilan Rabinovitch, Director of Technical Community, Datadog

November 30, 2016

DEV202

From Dial-up to DevOpsAOL's Migration to the Cloud

Page 2: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

1983

Page 3: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Apple Lisa - Victor R. Ruiz - https://flic.kr/p/AoE6LK

Page 4: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

A few things have

changed....

Page 5: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

USRobotics Sportster 14,400 modem - Ramon Machado - http://bit.ly/2cn8FKc

Page 6: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

acquisitions

68

Page 7: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

advertising companies

15

Page 8: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 9: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Online advertising

across desktop,

mobile, and TV

Page 10: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 11: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

What that looks like internally

• Many complex systems

• High traffic, low latency response

• Downtime isn't an option

• Every transaction has a cost and value

Page 12: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

What does DevOps

mean to us?

Page 13: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

CultureAutomationMeasurementSharingJohn Willis (@botchagalupe), Damon Edwards (@damonedwards)

Page 14: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

6 years ago...

Page 15: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Starting a new team

• New advertising product

• New R&D engineering team

• New (untested) software

• No dedicated QA

Page 16: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

A recipe for

disaster...?

Page 17: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Not necessarily

Page 18: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 19: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 20: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 21: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 22: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Time to scale

Page 23: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Moving into production

• Rough estimate of full traffic load

• Look at current CPU/memory/IO

• Do some maths

• Purchase appropriate hardware

Page 24: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 25: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Done

Page 26: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Made it to production

• Everything was stable

• Traffic was steady

• System worked as expected

• Product owners & engineers were happy

Page 27: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Everything's running smoothly

Tra

ffic

Time

Page 28: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Wait, what?

Tra

ffic

Time

Page 29: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 30: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

What happened?

• Complete data centre outage (not our facility)

• Servers powered up in random order

• Data corruption

• Recovery time was out of our hands

Page 31: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

"Success consists of going from failure to

failure without loss of enthusiasm."

Winston Churchill

Page 32: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

This isn’t going to work

Page 33: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Making our teams better

• Need to speed up development cycle

• Agile project management

• Remove layers of communication

• Remove manual steps

Page 34: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Agile is changing thinking

DevOps is changing doing

Page 35: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

CultureAutomationMeasurementSharing

Page 36: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Culture

• People and process first

• Teamwork is key

• Initiative to find solutions

• More important than any tool

Page 37: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 38: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 39: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Automation

• Replaced people with shell scripts*

• Scaled with code rather than people

• Automated build and deploys

• Made things virtual not physical

* I guarantee someone here is wearing this t-shirt

Page 40: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 41: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 42: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

To the cloud

Page 43: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Internal private cloud

• Improved flexibility

• Management all in-house

• Development speed increased

• Parity between environments

Page 44: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Internal private cloud

• Strong internal security

• Still limited by capacity

• Provisioning more hypervisors is difficult

• Responsible for own geo distribution

Page 45: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

That capital cost

didn't just go away

Page 46: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 47: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 48: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Datacenter Backup batteries - Jelson25 - http://bit.ly/2cwldN5

Page 49: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 50: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Building on AWS

• Capital expense to variable expense

• No need to worry about hardware refresh

• Lead time to spin up reduced

• Time to deployment reduced

Page 51: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Building on AWS

• Open flexibility to engineers

• Much quicker to try something new

• Iteration cycles speed up

• No waiting on hardware

Page 52: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Treat servers like

cattle, not pets

Page 53: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

but…

Page 54: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Do you know what this cow is doing?

Page 55: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Collecting data is cheap;

not having it when you

need it can be expensive

Page 56: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Measurement/metrics

• If it moves, measure it

• How is the application performing?

• Are things getting better? Or worse?

• Remove the guesswork from decisions

Page 57: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 58: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 59: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

@honest_update

Page 60: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 61: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Sharing

• Encourage teams to share their data

• Help people to solve their own issues

• Everyone plays a part in getting to production

• Play nice with others

Page 62: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)
Page 63: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Datadog Slack

Page 64: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

What did we learn?

• Push hard to refactor applications

• Changing mindsets is harder than changing tech

• Empower people to make their own decisions

• Give them the data to do it

Page 65: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

This is a learning process

• Don't search for a silver bullet

• Data is key to knowing what's going on

• Understand workloads

• You wouldn't heat your home all day in summer

Page 66: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Build a culture that can

adapt quickly to change

Page 67: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Process should make life

easier, not be a checklist

Page 68: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Don't get complacent;

technology moves quicker

than people

Page 69: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

@alz

@irabinovitch

Page 70: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Thank you!

Page 71: AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DEV202)

Remember to complete

your evaluations!