Resolved
The incident have been resolved. At the peak there was an ~1 hour delay in processing the events.
Monitoring
A fix has been implemented. We are expecting to process the backlog of events in ~15 minutes.
Investigating
The processing pipeline is degraded in performance to the level where it basically stalls. We are still processing occurrences, but with increasing latency. We are investigating the issue.