ERR_AI_OVERLOADED β AI Overloaded
HTTP Status: 503
Retryable: Yes
Automatic retry: The callWithRetry utility retries up to 3 times with exponential backoff. This error surfaces only after all retries are exhausted.
What the User Sees
AI service is temporarily unavailable due to high demand. (ERR_AI_OVERLOADED)
What Causes This Error
Anthropic returns HTTP 529 (βoverloadedβ) when their infrastructure is under heavy load. This is fundamentally different from rate limiting (429) β it affects all customers globally, not just our API key. The 529 status code is Anthropic-specific and indicates their servers cannot accept new requests at all.
This typically occurs during periods of extremely high demand across the Anthropic platform, or during partial infrastructure incidents. The condition is usually temporary, resolving within minutes, but can last longer during major incidents.
The system automatically retries these requests via callWithRetry with exponential backoff. The user only sees this error if all 3 retry attempts fail, meaning the overload condition persisted for the full retry window (typically 30-60 seconds depending on backoff timing).
Unlike rate limiting, there is nothing the service can do to reduce the likelihood of this error β it is entirely dependent on the providerβs infrastructure health.
Resolution Steps
For Users
- Wait a few minutes and try again. This is a temporary condition with the AI provider.
- Try the βbudgetβ quality tier, which uses Google Gemini instead of Anthropic. Gemini may not be affected by an Anthropic outage.
- Check https://status.anthropic.com to see if there is an ongoing incident.
- If the error persists for more than 15 minutes, consider trying again later.
For Administrators
- Check https://status.anthropic.com for active incidents or degraded performance notices.
- If the outage is prolonged (more than 30 minutes), consider temporarily routing all conversion traffic through Gemini by enabling
budgetModein the smart cascade converter configuration. - Monitor the Grafana dashboard for the frequency of 529 errors. A spike that correlates with Anthropicβs status page confirms the root cause.
- No action is needed to βfixβ this β the condition resolves when Anthropicβs infrastructure recovers.
- If users are reporting this frequently but Anthropicβs status page shows no issues, the problem may actually be rate limiting (ERR_AI_RATE_LIMITED) β check the raw HTTP status codes in the logs.
Related Errors
- ERR_AI_RATE_LIMITED β Per-key rate limiting (429), different from infrastructure overload