Toronto 2016 - Proposal

Platinum sponsors

Back to proposals overview - program

Public Postmortem on After Hours Maintenance Gone Awry

Abstract:

We needed to add some capacity to a Redis cluster, which would have customer impact, so it was scheduled for after hours. 20 minutes into a 30 minute window, wanted to close open connections, so we stopped a bunch of services. One of them had had some init changes recently, and it turns out they weren't tested and made it to production. What do you do when you're an Admin but the problem is dev code and it's 1am?

Speaker:

Jason Shaw, @jasonious

Long time Linux SysAdmin with too much empathy for the BOfH style. Currently working on continuous integration and deployment at FreshBooks.

blog comments powered by Disqus
Deloitte


Gold sponsors

Pivotal Blended Perspectives Chef PagerDuty GitHub ThoughtWorks Shopify TriNimbus NewRelic VictorOps
Become a Gold Sponsor!

Silver sponsors


Be the first to become a Silver Sponsor!

Bronze sponsors


Be the first to become a Bronze Sponsor!

Lunch sponsors

telus
Become a Lunch Sponsor!

Media sponsors

O
Become a Media Sponsor!