Previous incidents
Elevated Error Rates on Agents
Resolved May 19, 2025 at 9:02pm UTC
The fix for the server-side errors of the TCE agent has been successfully deployed and server-side error rates on the TCE agent have returned back to normal.
Contextual's team is continuing to monitor system performance and will follow up with a full root cause analysis and corrective actions.
2 previous updates
Maintenance resulting in downtime
Resolved May 2, 2025 at 5:42am UTC
At 9:31 PM PST, cluster maintenance on the Dragon cluster resulted in unexpected downtime and all agents being unavailable.
The incident was mitigated by Contextual on-call engineers and all agents returned to proper function at 10:35 PM PST.
Maintenance Resulting in Downtime
Resolved Apr 11, 2025 at 12:05am UTC
Incident has been mitigated.
2 previous updates
Degraded performance
Resolved Mar 31, 2025 at 1:27am UTC
Incident resolved.
2 previous updates
Frontend Authorization Unavailable
Resolved Mar 25, 2025 at 7:50pm UTC
The incident has been mitigated by the Auth0 team and authentication is back to full availability.
2 previous updates
Unavailability across CE and QCL Agents
Resolved Mar 4, 2025 at 10:08pm UTC
The issue has been root-caused to degraded model infrastructure. Infrastructure has been reset and restored, and availability has been restored across all agents.
We are continuing to monitor agent performance and investigate the full root cause to ensure no recurrences occur in the future.
1 previous update