Status: Resolved
Start: 23 Apr 2025 00:18 UTC | End: 23 Apr 2025 01:47 UTC
Total duration: 1 h 29 m
Description
Between 00:18 UTC and 01:47 UTC our Enterprise US ingestion endpoints were unable to accept new connections. End-users experienced HTTP 503 errors and some real-time events during this window were not recorded.
Root Cause
A scheduled down-scaling event reduced the cluster node size. The extra connections that were safely handed off to the remaining nodes exceeded the VMs' maxi...