Henry Ford once said, “The only real mistake is the one from which we learn nothing.” So how can we learn the most from system failures? This session will move beyond “blameless” postmortems to show how we can use data to avoid & mitigate future failures.
I’ll cover best practices for gather systems-related data, people-related data & how to use the data to formulate actionable response plans & avoid repeating failures.”
Jason is a technical writer and evangelist at Datadog, where he works to inspire developers and ops engineers with the power of metrics and monitoring. He’s also a co-organizer of DevOpsDays Portland. When he’s not speaking at conferences or helping organize them, he likes to spend time on planes “travel hacking” and hunting for interesting regional whiskies.