Post-mortem: Downtime on November 30, 2022

After yesterday’s deployment, we faced a downtime on our reference server. We want to share with you a detailed explanation of what happened. Impact Our reference server was offline for 13 minutes. The application responded to every request with a maintenance message. No one was able to work with the API or user interface during that time. Root Causes After deploying changes to production the application could not load some of our Ruby gems anymore…
OpenSUSE Planet