Skip to content

Latest commit

 

History

History

0x19-postmortem

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

img

Postmoterm/ Incident report

there-was-an-incident

Concepts

For this project, we expect you to look at this concept

About

An incident report postmortem is a retrospective analysis of a specific incident or problem that occurred in a system, service, or organization. The goal of the postmortem is to identify the root cause of the incident, to document the findings, and to develop recommendations for preventing similar incidents in the future.

Background context

incident-report

Any software system will eventually fail, and that failure can come stem from a wide range of possible factors: bugs, traffic spikes, security issues, hardware failures, natural disasters, human error… Failing is normal and failing is actually a great opportunity to learn and improve. Any great Software Engineer must learn from his/her mistakes to make sure that they won’t happen again. Failing is fine, but failing twice because of the same issue is not.

A postmortem is a tool widely used in the tech industry. After any outage, the team(s) in charge of the system will write a summary that has 2 main goals:

  • To provide the rest of the company’s employees easy access to information detailing the cause of the outage. Often outages can have a huge impact on a company, so managers and executives have to understand what happened and how it will impact their work.
  • And to ensure that the root cause(s) of the outage has been discovered and that measures are taken to make sure it will be fixed.

Resources

Read or watch:

  1. Incident report, also known as postmoterm
  2. The importance of an incident postmortem process
  3. What is an Incident Postmortem?