What does /goal do differently from previous agent loops?

Earlier autonomous loops in Claude Code were either wall-clock driven (like /loop from the claude-loop plugin) or human-driven (plan step, code step, review step). /goal is the first official primitive where a fast evaluator model (Haiku) automatically checks after every turn whether a verifiable condition is met. It doesn't run on a timer or a plan — it runs until the goal is reached.

How do I write a verifiable completion condition?

Three elements. One: a single measurable end state (test result, exit code, file count, regex match). Two: a named verification method (npm test exits 0, git status is clean). Three: constraints (what must not change during the run). Bad: 'make it better'. Good: 'npm test exits 0 in the auth module, diff is limited to src/auth, no new warning in the lint output'.

Can /goal burn unlimited tokens?

Yes. /goal has no built-in token cap. One reported case on Reddit: a user burned through the entire weekly token allowance on a Max ($200/month) plan in 14 hours of overnight /goal runs on a vague goal that could never be verified. Always bake a limit directly into the condition: 'or stop after 40 turns' or 'or stop if 90 minutes elapse'. The status view (/goal with no arguments) shows current token spend — watch it during the first hour of use.

Claude Code /goal: Set a Finish Line, Walk Away, Come Back Done

You give the task. You go make coffee. You come back an hour later — Claude is done. That’s a marketing pitch every AI company has promised for five years. It’s failed five times.

This time it works better. Not perfectly — but enough to change how you work this Monday.

/goal shipped on May 11, 2026 in Claude Code 2.1.139. One widely-shared Medium piece calls it “the single most underrated AI feature of 2026”. Pasquale Pillitteri calls it “the command Codex invented and Claude Code copied in 11 days”. Both are overstated, but neither is wrong.

What `/goal` Actually Does

Default Claude Code follows one rule: one prompt → one turn → it waits for you. Even if Claude returns 800 lines of code, it stops and waits.

/goal inverts that. Instead of giving steps, you give a goal — something that can be verified. Claude works on its own across as many turns as needed. After every turn, a fast evaluator model (Haiku) reads the transcript and answers one question: is the condition met?

If no, it says why — and that sentence becomes the next instruction. If yes, /goal clears and control returns to you. No second prompt in between.

In one sentence:

You give a condition. Claude works. Haiku checks the transcript after each turn. The moment it says “yes”, the goal is done.

“
Verifiable end state is everything here. Without it, /goal is just an expensive way to sleep at the keyboard.
”

A Concrete Example

Vague goal:

/goal make this codebase better

This doesn’t work. Haiku has nothing to evaluate. Claude will keep going, the evaluator will keep saying “not measurable”, and tokens will drain. One developer on Reddit reportedly burned through their weekly Max-plan ($200/mo) token allowance in 14 hours of overnight /goal runs on a goal like this.

Measurable goal:

/goal `rg 'legacy_client'` returns no matches under services/checkout;
      `go test ./...` exits 0;
      diff touches only services/checkout;
      or stop after 40 turns

Three measurable conditions plus a hard cap. Haiku reads the transcript, finds the rg result, finds the test exit code, looks at the diff scope, and either says “yes” or names exactly what’s missing.

That’s the difference between “make it better” and “npm test exits 0 and no file outside src/auth/ changes”.

Status checkpoint

Type /goal with no arguments any time and you see:

the active condition
elapsed time
turn count
token spend
the evaluator’s last reason

That’s your early-warning system. If after ten turns the evaluator keeps saying “not yet, claude says it ran tests but didn’t show output”, the condition is badly written — stop (/goal clear) and rewrite.

What `/goal` Doesn’t Do

This matters more than the feature list. Most /goal problems are problems with how people use it, not with the model.

The evaluator can’t run a command. It only sees the transcript. If Claude writes “I ran the tests” instead of actually showing output, Haiku has nothing to grade. Force Claude to print the raw output of whatever you’re measuring — exit code, match count, file contents.

No hard token cap. /goal will keep running until the condition is satisfied or you stop it. Bake limits into the condition itself: or stop after 40 turns, or stop if 90 minutes elapse. The status view (/goal with no args) shows current spend — watch it during the first hour.

One goal per session. A new /goal replaces the previous one. No queues, no nested goals.

Trust mode required. It won’t run in an untrusted workspace. If you have disableAllHooks set, also no.

It’s not the sci-fi autonomous agent. You still need clear scope, sane sandboxing, real measurable outputs. /goal only judges whether what Claude wrote in the transcript matches what you asked for.

`/goal` vs. Other Autonomous Workflows

/goal isn’t the first attempt at autonomous Claude Code. Here’s a quick decision map:

Workflow	Trigger	Loop bounded by	When to choose
`/goal` (official, 2.1.139+)	one verifiable condition	evaluator says “yes”	Measurable end state: tests, build, diff scope, exit code.
Loop (claude-loop plugin)	`/loop` skill	wall-clock interval	Periodic action: check CI, poll an endpoint, wait for an event.
`/feature-dev` → `/recheck` (see Shift+Tab Is a Trap)	step-by-step plan	human review after implementation	Feature work where you want plan → code → verify, not autonomous iteration.
`/writing-plans` from superpowers	structured plan with checkpoints	edge cases + acceptance criteria	Migrations, refactors, infra — work where many things can go wrong at once.

Short answer: use /goal when you have a measurable end state and want to walk away. Use /feature-dev when you want a better plan and human-driven iteration. Use /loop when you need a periodic pulse, not autonomy.

And the frame from Skills vs. Agents still holds: skills first, agents next. /goal gives you a new way to build an autonomous agent out of work you used to do by hand — but it won’t turn a bad brief into a good workflow.

“
Most /goal failures aren’t model failures. They’re vaguely written finish lines.
”

What to Do This Week

Three small moves, no big program:

1. Check your version. claude --version — if it’s below 2.1.139, upgrade. /goal isn’t there.

2. Pick one task with a verifiable goal. Something you currently iterate on by hand: fix failing tests in one module, add missing type signatures, run the linter and clear warnings. The goal has to be hard-measurable — npm test exits 0, tsc --noEmit exits 0, no warning in output.

3. Issue it with a hard cap. Copy and adapt:

/goal <your condition>; or stop after 30 turns

After the first run, type /goal (no arguments) and check what it cost. Under $1 with the condition passing → you have a new workflow. Over $5 and the condition still hasn’t passed → the condition was vague, rewrite it more concretely.

If you want to figure out how to wire /goal smartly into your workflow — Claude Code, agent boundaries, MCP servers, custom skills — and walk it through a live task, 1:1 AI mentoring is exactly the right fit. One hour, a map of your setup, concrete next steps. For full teams a workshop makes more sense.

Summary

/goal is the first official Anthropic autonomous-loop primitive in Claude Code (2.1.139+, May 11, 2026).
Works on verifiable completion conditions — vague goals = expensive loops.
The evaluator is Haiku, transcript-only, can’t run a command. Force Claude to surface raw output.
Always bake a hard cap into the condition: or stop after 40 turns.
Status: /goal (no args). Cancel: /goal clear or aliases.
For the broader autonomy story, the frame from Skills vs. Agents still holds — /goal adds a tool, not a strategy.

Pick one verifiable goal. Bake the cap. Measure. Then decide whether to scale.

— Jirka

Frequently Asked Questions

What does /goal do differently from previous agent loops?: Earlier autonomous loops in Claude Code were either wall-clock driven (like /loop from the claude-loop plugin) or human-driven (plan step, code step, review step). /goal is the first official primitive where a fast evaluator model (Haiku) automatically checks after every turn whether a verifiable condition is met. It doesn't run on a timer or a plan — it runs until the goal is reached.
How do I write a verifiable completion condition?: Three elements. One: a single measurable end state (test result, exit code, file count, regex match). Two: a named verification method (npm test exits 0, git status is clean). Three: constraints (what must not change during the run). Bad: 'make it better'. Good: 'npm test exits 0 in the auth module, diff is limited to src/auth, no new warning in the lint output'.
Can /goal burn unlimited tokens?: Yes. /goal has no built-in token cap. One reported case on Reddit: a user burned through the entire weekly token allowance on a Max ($200/month) plan in 14 hours of overnight /goal runs on a vague goal that could never be verified. Always bake a limit directly into the condition: 'or stop after 40 turns' or 'or stop if 90 minutes elapse'. The status view (/goal with no arguments) shows current token spend — watch it during the first hour of use.

Claude Code /goal: Set a Finish Line, Walk Away, Come Back Done

What `/goal` Actually Does

A Concrete Example

Status checkpoint

What `/goal` Doesn’t Do

`/goal` vs. Other Autonomous Workflows

What to Do This Week

Summary

You Might Also Like

Related posts

AI review without memory is just expensive déjà vu

Shift+Tab Is a Trap: Why Your AI Plan Ignores Half Your Codebase

Why I Never Read the First Plan from Claude Code

What /goal Actually Does

A Concrete Example

Status checkpoint

What /goal Doesn’t Do

/goal vs. Other Autonomous Workflows

What to Do This Week

Summary

You Might Also Like

Related posts

AI review without memory is just expensive déjà vu

Shift+Tab Is a Trap: Why Your AI Plan Ignores Half Your Codebase

Why I Never Read the First Plan from Claude Code

What `/goal` Actually Does

What `/goal` Doesn’t Do

`/goal` vs. Other Autonomous Workflows