90
The Unrealized Role of: Monitoring & Alerting @jasonhand | VictorOps | #AllDayDevOps

The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Embed Size (px)

Citation preview

Page 1: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

The Unrealized Role of:

Monitoring & Alerting@jasonhand | VictorOps | #AllDayDevOps

Page 2: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 3: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 4: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

THE UNREALIZEDROLE OF:

Monitoring& Alerting

@jasonhand | VictorOps | #AllDayDevOps

Page 5: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

JASONHAND

DevOps Evangelist

VictorOps@jasonhand | VictorOps | #AllDayDevOps

Page 6: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 7: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

2015MONITORING

SURVEY@jasonhand | VictorOps | #AllDayDevOps

Page 8: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 9: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHY ARE YOU COLLECTING THIS DATA?NOTE: You may choose more than one▸ Performance analysis and trending▸ Fault and Anomaly detection▸ Capacity Planning▸ A/B Testing

@jasonhand | VictorOps | #AllDayDevOps

Page 10: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

THE RESULTSNOTE: Respondents may have chose more than one▸ Performance analysis and trending - 63%▸ Fault and Anomaly detection - 53%▸ Capacity Planning - 45%▸ A/B Testing - 11%

@jasonhand | VictorOps | #AllDayDevOps

Page 11: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Tyranny of the

S.L.A.(Service Level Agreement)

@jasonhand | VictorOps | #AllDayDevOps

Page 12: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

HIGHAVAILABILITY

Prediction & Prevention@jasonhand | VictorOps | #AllDayDevOps

Page 13: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 14: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 15: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

THAT'S IMPORTANT

... BUT ...@jasonhand | VictorOps | #AllDayDevOps

Page 16: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 17: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

BUSINESSOBJECTIVES?

@jasonhand | VictorOps | #AllDayDevOps

Page 18: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

HAPPY CAMPER@jasonhand | VictorOps | #AllDayDevOps

Page 19: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

CUSTOMERSwant more than just

99.999% UPTIME@jasonhand | VictorOps | #AllDayDevOps

Page 20: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 21: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHERE'S THE

INNOVATION?@jasonhand | VictorOps | #AllDayDevOps

Page 22: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

HOW IMPORTANT IS

Learning & Innovation?@jasonhand | VictorOps | #AllDayDevOps

Page 23: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 24: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 25: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 26: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 27: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

The result of underutilizing monitoring & alertingis that the IT department and the organization have

no chance to...

LEARN,IMPROVE, ORINNOVATE.

@jasonhand | VictorOps | #AllDayDevOps

Page 28: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

CONTINUALLY UNDERSTANDING & RESPONDING TO THE FEEDBACK

from

monitoring, logging, & alertingallows you to use information about events in the past to drive future

actions.

@jasonhand | VictorOps | #AllDayDevOps

Page 29: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 30: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 31: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

It's not just about

PREDICTION& PREVENTION

@jasonhand | VictorOps | #AllDayDevOps

Page 32: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

RESPOND &REPAIR

...QUICKLY@jasonhand | VictorOps | #AllDayDevOps

Page 33: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

NOPE

@jasonhand | VictorOps | #AllDayDevOps

Page 34: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

MTTRRather Than

MTBF@jasonhand | VictorOps | #AllDayDevOps

Page 35: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

FAILURE ISINEVITABLE

@jasonhand | VictorOps | #AllDayDevOps

Page 36: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

US·ER/ˈYOOZƏR/

DISTRIBUTED FAULT INJECTION TEST SUITE FOR PRODUCTION.

credit: Leon Fayer (@papa_fire)@jasonhand | VictorOps | #AllDayDevOps

Page 37: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

SUCCESSis a result of

FAILURE@jasonhand | VictorOps | #AllDayDevOps

Page 38: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

UNDERSTAND

LEARNINNOVATE

@jasonhand | VictorOps | #AllDayDevOps

Page 39: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

RE·SIL·IENT/RƏˈZILYƏNT/

The ability to resist, absorb, recover from or successfully adapt to adversity or a change in conditions

@jasonhand | VictorOps | #AllDayDevOps

Page 40: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

CHANGEcan cause failure

but innovation requires

CHANGE

@jasonhand | VictorOps | #AllDayDevOps

Page 41: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

CONFLICT@jasonhand | VictorOps | #AllDayDevOps

Page 42: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

CHANGEREQUIRED

@jasonhand | VictorOps | #AllDayDevOps

Page 43: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Without deviation from the norm, progress is not possible

— Frank Zappa

@jasonhand | VictorOps | #AllDayDevOps

Page 44: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

What Did You

LEARNFrom the Recovery Efforts?

(including monitoring & alerting)@jasonhand | VictorOps | #AllDayDevOps

Page 45: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

POSTMORTEMS / LEARNING REVIEWS:Stories of:

WHAT TOOK PLACEleading up to & during

the disruption & recovery efforts@jasonhand | VictorOps | #AllDayDevOps

Page 46: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHO WASINVOLVED?

@jasonhand | VictorOps | #AllDayDevOps

Page 47: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHAT DID THEY

SEE?@jasonhand | VictorOps | #AllDayDevOps

Page 48: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHAT WAS

SAID?@jasonhand | VictorOps | #AllDayDevOps

Page 49: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHAT

ACTIONSWERE TAKEN?

jhand.co/chatopsbook

@jasonhand | VictorOps | #AllDayDevOps

Page 50: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

HOW DOevents & actions

CORRELATEOVER TIME?

@jasonhand | VictorOps | #AllDayDevOps

Page 51: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

5 Why's@jasonhand | VictorOps | #AllDayDevOps

Page 52: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

5 Why's@jasonhand | VictorOps | #AllDayDevOps

Page 53: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHAT IS THE "cause"OF THE PROBLEM?

Root Cause is ...

@jasonhand | VictorOps | #AllDayDevOps

Page 54: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

OUR...

obsession with

"Root Cause"@jasonhand | VictorOps | #AllDayDevOps

Page 55: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

ASKING "WHY".. leads to ..

BLAME@jasonhand | VictorOps | #AllDayDevOps

Page 56: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

BLAMINGLEADS TO..

operators hiding relevant & important information

@jasonhand | VictorOps | #AllDayDevOps

Page 57: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

We must

BELIEVEthat our operators are doing their best given the

constraints of the "system"@jasonhand | VictorOps | #AllDayDevOps

Page 58: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

"We are here to"

LEARNFrom Failure

(and success)@jasonhand | VictorOps | #AllDayDevOps

Page 59: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

RATHER THAN ..@jasonhand | VictorOps | #AllDayDevOps

Page 60: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

AVOIDFAILURE

@jasonhand | VictorOps | #AllDayDevOps

Page 61: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHAT'S THE

STORY?@jasonhand | VictorOps | #AllDayDevOps

Page 62: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

INNOVATELearning from both success & failure

to develop & implementsmall incremental improvements

is critical.@jasonhand | VictorOps | #AllDayDevOps

Page 63: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

MONITORING &ALERTINGHelps us understand the story in greater detail

@jasonhand | VictorOps | #AllDayDevOps

Page 64: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

LEARNINGORGANIZATION

@jasonhand | VictorOps | #AllDayDevOps

Page 65: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Learning does NOT come from

READING&

LISTENING@jasonhand | VictorOps | #AllDayDevOps

Page 66: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Learning comes from

DOING@jasonhand | VictorOps | #AllDayDevOps

Page 67: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Real Learning comes from:

OBSERVINGORIENTINGDECIDINGACTING

John Boyd's OODA Loop@jasonhand | VictorOps | #AllDayDevOps

Page 68: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Example:

LEARNING TO PLAY THE

DOBRO GUITAR@jasonhand | VictorOps | #AllDayDevOps

Page 69: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 70: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

LEARNING

@jasonhand | VictorOps | #AllDayDevOps

Page 71: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WHY?Go from knowing...to understanding...

to learning

NOTE:(Requires making mistakes)

@jasonhand | VictorOps | #AllDayDevOps

Page 72: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

We will trade some uptime in exchange for innovation-Dave Hahn (Netflix)

DevOpsDays Boise 2016@jasonhand | VictorOps | #AllDayDevOps

Page 73: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

SHIFT OUR GAZEfrom:

MAINTAINING& PROTECTING

@jasonhand | VictorOps | #AllDayDevOps

Page 74: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

LEARNINGWhich leads to...

IMPROVING& INNOVATING

@jasonhand | VictorOps | #AllDayDevOps

Page 75: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

WE INCREASE VALUE OF:

- Monitoring & Alerting- IT teams

- Products & Services- Organization

@jasonhand | VictorOps | #AllDayDevOps

Page 76: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

HYPOTHESIZEEXPLORESTRETCH

EXPERIMENTFAIL

LEARNTry Again

@jasonhand | VictorOps | #AllDayDevOps

Page 77: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 78: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

LEARNING & INNOVATINGleads to uncovering new ways of

BUILDING, DEPLOYING, AND MAINTAINING SOFTWARE & INFRASTRUCTURE

Which leads to...@jasonhand | VictorOps | #AllDayDevOps

Page 79: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

RESILIENTSYSTEMS

@jasonhand | VictorOps | #AllDayDevOps

Page 80: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

The

By-productof a highly

RESILIENTsystem is ...

@jasonhand | VictorOps | #AllDayDevOps

Page 81: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 82: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

HIGHLYAVAILABLE

SYSTEM@jasonhand | VictorOps | #AllDayDevOps

Page 83: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

THE UNREALIZEDROLE OF:

Monitoring& Alerting is ....

@jasonhand | VictorOps | #AllDayDevOps

Page 84: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

LEARNING&

INNOVATION@jasonhand | VictorOps | #AllDayDevOps

Page 85: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

THANKYOU

Be Victorious!@jasonhand | VictorOps | #AllDayDevOps

Page 86: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

@jasonhand | VictorOps | #AllDayDevOps

Page 87: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

References:

Monitoring Survey: https://kartar.net/2015/08/monitoring-survey-2015---metrics/

Firefighter: https://www.learyfirefighters.org/wp-content/uploads/2013/09/cover-slide-1.jpg

Mechanic: https://upload.wikimedia.org/wikipedia/commons/4/4b/Flickr_-_Israel_Defense_Forces_-

_Airplane_Technician,_March_2010.jpgGnome Plan: http://www.nerdfitness.com/wp-content/uploads/2012/04/Screen-Shot-2012-03-30-at-3.15.38-AM-1024x7591.jpgNOC: https://upload.wikimedia.org/wikipedia/commons/0/03/

@jasonhand | VictorOps | #AllDayDevOps

Page 88: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

References:

Kodak: http://file.answcdn.com/answ-cld/image/upload/v1/tk/brand_image/b59911fc/

91d6e71d30a0878dfe3cb30a22751cb874a3ea8c.jpegVW Camper: https://upload.wikimedia.org/wikipedia/commons/d/d7/

VW_Camper.jpgBlockbuster: https://jordanandeddie.files.wordpress.com/2013/11/

blockbuster-feature.jpgBorders: http://smashingtops.com/wp-content/uploads/2012/06/

borders_logo1.jpg@jasonhand | VictorOps | #AllDayDevOps

Page 89: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Chained Hands: https://www.google.com/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&ved=0ahUKEwjgrNCD

h5TMAhXJs4MKHaoZDssQjBwIBA&url=http%3A%2F%2Fwww.publicdomainpictures.net%2Fdownload-picture.php

%3Fadresar%3D50000%26soubor%3Dhands-in-chains.jpg%26id%3D40426&bvm=bv.119745492,d.amc&psig=AFQjCNFIdnDPzSqiLA-

znIW5SCTCUHhqEw&ust=1460926880336203Inevitable: http://vignette4.wikia.nocookie.net/matrix/images/5/51/

SMITH.png/revision/latest?cb=20110214092002Bulb: https://smhttp-ssl-37293.nexcesscdn.net/media/catalog/

@jasonhand | VictorOps | #AllDayDevOps

Page 90: The Unrealized Role of Monitoring & Alerting w/ Jason Hand

Accident Free:http://www.compliancesigns.com/media/digital-scoreboard/1000/Safety-Awareness-Sign-DSE-195271000.gif

Stewie:http://chroniclesofredmark.com/wp-content/uploads/2014/01/

Stewie.gifchange: http://i.imgur.com/EQyC6N3.gif

Hard drive: https://i.imgur.com/pWsKSEf.gifChange: https://farm6.staticflickr.com/

5208/5270199049df99b234e9od.jpgValue: https://d13yacurqjgara.cloudfront.net/users/6437/

screenshots/1405551/value-cropped.gif@jasonhand | VictorOps | #AllDayDevOps