Lean Operations: Optimising Incident Management
There are a ton of best practices in the operations world. But what really makes sense for your team? The reality is that one size *doesn't* fit all. Lean is a framework for designing and optimising processes. Let's apply it to a typical incident management process and see what comes of it.
Outline/Structure of the Talk
Lean Thinking is an analysis framework that has revolutionised various domains like manufacturing and software development. It's principles are:
Optimize the whole
Build quality in
Deliver fast by managing flow
We will look at each of these principles and apply them to the incident management lifecycle. At the end of this we will see *why* certain best practises make sense, practices that aren't common but should be and common practises which are probably not very helpful
At the end of this we will see *why* certain best practices make sense, practices that aren't common but should be and other common practices which are probably not very helpful in certain use-cases.
This will also give clarity on how to set-up a good incident response process to ensure balancing feature velocity while maintaining system reliability.
Anyone interested in learning how to optimise the incident response & resolution process to maximise uptime & overall system reliability.