Utility-Based Agents

Utility-based agents rank options by preference, cost, risk, or reward and then choose the highest utility action.

Unlike goal-based agents (which ask "does this reach the goal?"), utility-based agents ask "which option gives the best overall value?"

How it works (architecture)

utility = 0.5 * quality + 0.3 * speed + 0.2 * cost_efficiency - risk_penalty

You can tune weights for each business context.

OpenAI / Anthropic model routing in production stacks - choose model tier per request based on cost, latency, and quality targets.
Approximate API examples: OpenAI often from $0.20/$1.25 up to $2.50/$15 per 1M tokens; Anthropic often from $1/$5 to $3/$15 for common tiers.
Cloud contact center AI - route conversations by value/risk (self-serve vs human escalation).
Approximate pricing: usually seat + usage bundles, frequently $30-$150+ per seat/month depending on platform.
Commerce recommendation systems - maximize conversion while controlling margin and return risk.
Cost profile: usually infrastructure + model usage; highly variable by traffic.

Optimizing one metric while damaging another (for example, lowest cost but poor user outcomes).
Utility gaming (agent learns shortcuts that inflate score but hurt business value).
Missing penalties for compliance or brand risk.