All videos
All videos
You put out the fire on production, what’s next?
September 28, 2023
Sh*t happens, releases don't always go as planned, production systems break sometimes. Whether it's bug in your code, library you use or infrastructure/hardware failure first thing you need is to bring the system back to life with minimal damage.
But what happens next? Should we grab a coffee and carry on with our daily tasks?
Have you heard about a ritual called "Post Mortem Analysis"? Seems scary but actually it's incredibly valuable when done properly and that's what I'd like to talk about. What it is, how to conduct such analysis, who should and who shouldn't be involved, what to watch for on the go and what are its ultimate goals. All in all you don't want to end up with the same production outage tomorrow, do you?
Other videos that you might like

The state of sbt 1.0 and sbt server
Eugene Yokota, Dale Wijnand

A journey from OSS project to fully-fledged product on Azure Marketplace
Bartek Antoniak

Implementing Machine Learning Algorithms for Scale-Out Parallelism
William Benton

Stream processing in telco – case study based on Apache Flink & TouK Nussknacker
Maciek Próchniak