Incident with GitHub PR Checks

Incident Report for Legit Security

Resolved

This incident resulted from a failure in a critical queue responding to status checks, causing some status checks to get ‘stuck’​.

We resolved the specific issue that affected our queues and upgraded our queueing system configuration so that such errors won't happen again. ​

We are implementing a change so that our PR Checks feature will be resilient to queueing errors introducing a new HA component​

We applied additional monitoring to our system so that we will be able to identify similar issues before they become "critical" and affect users.
Posted Feb 06, 2024 - 05:00 UTC