44
Some Conference SCALE 10x LCA 2012 Mistakes were made Selena Deckelmann [email protected] Twitter/IRC: @selenamarie

Mistakes were made - LCA 2012

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Mistakes were madeSelena Deckelmann

[email protected]/IRC: @selenamarie

Page 2: Mistakes were made - LCA 2012

LCA 2012

Failure

Page 3: Mistakes were made - LCA 2012

LCA 2012

“Prevention”“Risk management”

“Risk mitigation”“MTBF, MTTR”

“Success Engineering”

Page 4: Mistakes were made - LCA 2012

LCA 2012

Plan for the worst.Minimize risk.Fail.Recover, gracefully.

Page 5: Mistakes were made - LCA 2012

LCA 2012

“We don’t need a risk management plan,” he emphatically stated, “because this project can’t be allowed to fail.”

- Jim Hightower, http://jimhighsmith.com/2012/01/09/can-do-thinking-makes-risk-

management-impossible/

Page 6: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Page 7: Mistakes were made - LCA 2012

LCA 2012

Failure is an option.

Page 8: Mistakes were made - LCA 2012

LCA 2012

SCIENCE

Page 9: Mistakes were made - LCA 2012

LCA 2012

Dr. Jerker Denrell 

Page 10: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Page 11: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Page 12: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Page 13: Mistakes were made - LCA 2012

LCA 2012

"I think getting two accidents of this type at the same time

is a freak occurrence."-David Cunliffe, NZ Communications Minister

Page 14: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Page 15: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

“Further damage was incurred on Tuesday afternoon and our engineers returned to repair the damage,” said Virgin Media.

Page 16: Mistakes were made - LCA 2012

LCA 2012

Plan for when things fail.

Page 17: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Page 18: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Page 19: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Tales of failure to...

Document

Test

Verify

Imagine

Implement

Page 20: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Failure to document.

Page 21: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Moving Day

Thanks, David Prior!

Page 22: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Prevent documentation failures.

• Write documentation.

• Update documentation.

• Make documenting a step in your written process.

• Assign a fixed amount of time to that step.

Page 23: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Documentation tools

• Graphic designers. (Pretty wikis. Pretty docs. (Sphinx?) Diagrams.)

• Timelines.

• Bug tracking.

• Ordered todo lists.

Page 24: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Failure to test.

Page 25: Mistakes were made - LCA 2012

LCA 2012

“My first day posing as a sysadmin (~1990, no previous training....) I deleted all zero length files on a Sun workstation.”

Page 26: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Prevent testing failures.

• Verify success criteria.

• Write tests.

• Test with a buddy.

• Have a plan.

Page 27: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Testing tools

• Your favorite test framework

• Repeatable shell scripts

• Staging environments

Page 28: Mistakes were made - LCA 2012

SCALE 10x

LCA 2012

Failure to verify.

Page 29: Mistakes were made - LCA 2012

LCA 2012

“What does ‘-d’ actually do?”

Page 30: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Prevent verification failures.

• Have a plan for things going wrong.

• Have a staging environment.

• Test your rollback plan, not just your implementation plan.

Page 31: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Verification tools

• Staging environments

• Your buddy

Page 32: Mistakes were made - LCA 2012

LCA 2012

Failure to imagine.

Page 33: Mistakes were made - LCA 2012

LCA 2012

For my group the bottom line was

"don't trust anyone".

Thanks, Maggie!

Page 34: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Recover from failures to imagine.

• Share your stories of failure.

• Talk with people who are different from you.

• Act out implementation scenarios.

Page 35: Mistakes were made - LCA 2012

LCA 2012

Failure to implement.

Page 36: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Re-implement.

• Learn from mistakes.

Page 37: Mistakes were made - LCA 2012

LCA 2012

Reflection.(or, the Post-Mortem)

Page 38: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Before

• Plan to do a post-mortem.

• Document the plan with numbered steps and a timeline.

• Test the plan and the rollback plan.

• Identify a “point of no return”.

Page 39: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

During

• Screen sharing: UNIX screen, VNC, etc.

• Chatroom: IRC, AIM, Campfire (scrollback!)

• Voice: Campfire, Skype, VOIP, POTS call line

• Headsets!

• Designated time-keeper.

Page 40: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

After

• Documentation updates

• Post-mortem to identify areas of success and areas for improvement.

• Limit improvements to 1-2 things.

Page 41: Mistakes were made - LCA 2012

LCA 2012

Plan for the worst.Minimize risk.Fail.Recover, gracefully.

Page 42: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Thanks!

Page 43: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Mistakes were madeSelena Deckelmann

[email protected]/IRC: @selenamarie

Page 44: Mistakes were made - LCA 2012

Some Conference

SCALE 10x

LCA 2012

Photo credits

• Flickr: sheepguardingllama