Glossary
What is Agentic Resource Exhaustion?
Definition — Agentic Resource Exhaustion (ARE)
Agentic Resource Exhaustion is the condition in which autonomous AI agents consume computational, financial, or operational resources beyond intended or sustainable limits — through recursive reasoning loops, unbounded tool-call chains, retry storms, context-window compounding, or multi-agent coordination overhead — resulting in unplanned cost overruns, margin erosion, or service degradation.
Canonical definition. Goldfinch Economics. April 2026.
Most agent cost problems are not caused by bad models or excessive usage. They are caused by architecture decisions that leave execution unbounded. An agent that retries on every error, re-sends full conversation history on each turn, or fans out into sub-agents without a spend cap can consume orders of magnitude more resources than a well-designed equivalent.
Agentic Resource Exhaustion is predictable and preventable. The term exists to name a condition that teams encounter regularly but have no shared vocabulary to describe — and without a name, it is difficult to govern.
Six mechanisms that cause ARE
ARE manifests through six documented patterns. They can occur individually or compound — multiple mechanisms active simultaneously can produce cost incidents an order of magnitude beyond the baseline.
Documented ARE incidents
ARE is not a theoretical risk. The following incidents are drawn from public community reports and vendor post-mortems. They follow predictable mechanisms — all of which are addressable before deployment.
| Incident | Cost | Mechanism |
|---|---|---|
| Infinite loop between two agents | $47,000 | Two agents in a ping-pong loop for 11 days. No termination condition, no spend cap. |
| Data enrichment retry storm | $47,000 | 2.3 million API calls over a single weekend. Agent misinterpreted errors and retried without backoff. |
| Enterprise pipeline overspend | $40,000 | Exceeded quarterly budget despite alerts and dashboards. Monitoring detected the spend; no policy prevented it. |
| Multi-agent fan-out drain | $10,000 | Three parallel agents, each fanning to five sub-calls. 75 model invocations per top-level task. Under one week. |
| Surprise billing event | $30,000 | Root cause consistent with broad task scoping on a large input set. |
| Plan-tier quota exhaustion | ~$100/session | Users hitting monthly plan limits in under 90 minutes. Illustrates per-session unbounded execution without spend controls. |
Sources: community reports, vendor post-mortems, and public documentation. November 2025 – March 2026.
ARE compared to adjacent terms
Several related terms exist in the security and FinOps communities. ARE fills a specific gap they leave open.
| Term | Scope | Limitation |
|---|---|---|
| OWASP Unbounded Consumption (LLM10:2025) | All LLM applications | Security-framed. Does not address agent-specific mechanics or pre-deployment economics. |
| Denial of Wallet | Adversarial attacks only | Attack framing. Misses accidental exhaustion — which accounts for most real incidents. |
| Runaway agent | Engineering colloquial | Imprecise. No formal definition. No CFO-facing weight for governance conversations. |
| Cost anomaly | FinOps colloquial | Too vague. Applies to any unexpected cloud spend. Does not capture agent-specific mechanisms. |
| Agentic Resource Exhaustion | Agent-specific — accidental and adversarial | The gap filler. Agent-specific mechanisms, both adversarial and accidental causes, with formal definition suitable for governance frameworks. |
ARE is the problem. Shift-Left Costing is the solution.
Shift-Left Costing for Agentic AI is the practice of modeling costs, setting spend policies, and establishing governance controls before an agent is deployed. It is the discipline that makes ARE preventable rather than reactive.
Frequently asked questions
Score your agent costs before you deploy
Free. No account required. Takes 3 minutes.