Goal-Based Agents
Goal-based agents do not just react. They choose actions that help reach a target state.
This is the "planner" mindset: define a goal, evaluate possible next steps, and execute the best path.
How it works (architecture)
- Define goal and constraints.
- Generate plan candidates.
- Select next action toward goal.
- Execute and observe outcome.
- Re-plan until goal is reached or aborted.
Typical examples of goals
- "Create a market brief in 30 minutes with sources."
- "Automate invoice matching with less than 1% error."
- "Book meetings with qualified leads this week."
Best use cases
- Task completion workflows
- Planning across multiple steps
- Structured assistant behavior
- Research and synthesis pipelines
- Agentic coding and refactor workflows
Trade-offs
- Needs clear goal definition
- Planning cost can grow quickly
- Poor goals create confidently wrong behavior
Real-world company and service examples
- OpenAI - goal-driven assistants built with the Responses API + tools (web search, files, code execution, external APIs).
Approximate API range: $0.20/$1.25 to $2.50/$15 per 1M input/output tokens for many standard model tiers.
- Anthropic Claude - planning workflows with tool use for research, drafting, and operations support.
Approximate API range: often $1/$5 to $3/$15 per 1M input/output tokens for common tiers.
- Microsoft Copilot / GitHub Copilot style task completion flows (goal = "produce result, not just answer").
Approximate pricing: commonly $10-$39 per user/month depending on product tier.
Practical pricing references (consumer + API)
- ChatGPT Plus: about $20/month (web app subscription; API billed separately).
- Claude Pro: about $20/month monthly, or around $17/month annual equivalent in some regions.
- API cost can be lower than subscriptions for light usage, but higher for heavy automated workloads.
How to make goal-based agents reliable
- Write explicit success criteria ("done means X").
- Enforce budget caps (tokens, time, tool calls).
- Add checkpoints after every major step.
- Require citations/evidence for factual tasks.
- Add human approval for high-risk actions.
When to avoid
Avoid pure goal-based setups when:
- You need strict optimization under multiple conflicting objectives (use utility-based).
- Environment changes continuously and model must adapt from feedback (add learning components).