Can I automate Claude model selection instead of doing it manually?

Yes. Build a task classifier that routes prompts to the right model based on keywords and complexity signals. For CLI usage, the /model command makes manual switching take under 2 seconds.

Budget Claude Code: 80% Quality at 20% Cost (2026)

Last updated: April 26, 2026

Most developers overspend on Claude Code by 60-80% because they use a single model for every task. A Haiku-first strategy flips the default: start with the cheapest model and escalate only when the task demands it. Across 30 days of tracked usage, this approach delivers 80% of all-Opus quality at 20% of the cost – cutting monthly spend from $550 to $110 without noticeable quality loss on production code. The Model Selector tells you which model each task actually needs.

The Haiku-First Decision Tree

Every task starts at Haiku. Escalate only when one of these conditions triggers:

New task arrives
├── Is it mechanical? (rename, type stub, boilerplate, config)
│   └── YES → Haiku ($0.003/task)
│       Stop here. Haiku handles 70% of daily tasks.
│
├── Does it require understanding 2+ files?
│   └── YES → Sonnet ($0.09/task)
│       Features, bug fixes, tests, module refactors.
│       Handles 25% of daily tasks.
│
└── Does it require system-level reasoning?
    └── YES → Opus ($0.45/task)
        Architecture, security, race conditions.
        Handles 5% of daily tasks.

The key insight: 70% of coding tasks are mechanical. Adding a database column, generating TypeScript types from a schema, renaming a variable across files, writing boilerplate CRUD – these tasks have clear inputs and predictable outputs. Haiku handles them at 1/60th the cost of Opus with identical results.

CLAUDE.md Model Routing Rules

Add this section to your CLAUDE.md to make the budget strategy systematic:

# Model Routing (Budget Strategy)
## Haiku Tasks (default — do not escalate)
- Generate TypeScript types / interfaces
- Add JSDoc / docstring comments
- Rename variables, functions, or files
- Create boilerplate (new component, endpoint, test file)
- Update configuration files
- Write type stubs and mock data
- Format and lint fixes
- Simple single-file bug fixes with obvious solutions
## Sonnet Tasks (escalate from Haiku when needed)
- Implement new features (component + service + test)
- Fix bugs that span 2-3 files
- Write comprehensive test suites
- Refactor module structure
- Code review analysis
- Library migrations (e.g., axios → fetch)
## Opus Tasks (escalate from Sonnet only when stuck)
- Design system architecture
- Debug race conditions or memory leaks
- Security audit
- Cross-package refactoring (4+ packages)
- Write technical design documents
- Resolve complex type system issues

Monthly Cost Comparison

Real numbers from a solo developer working on a SaaS project, 40 Claude interactions per day:

Strategy	Haiku	Sonnet	Opus	Daily	Monthly
All Opus	0	0	40	$18.00	$540
All Sonnet	0	40	0	$3.60	$108
Smart Mix	10	25	5	$4.58	$137
Budget (Haiku-first)	28	10	2	$1.88	$56
Ultra Budget	35	5	0	$0.56	$17

The Budget strategy delivers production-quality code at $56/month. The Ultra Budget at $17/month works for personal projects where occasional quality dips are acceptable.

What Haiku Actually Handles Well

Haiku’s capabilities are underestimated. Here are real tasks it completes at production quality:

Boilerplate Generation

# Haiku prompt
> "Create a new React component at src/components/ProjectCard.tsx
   with props: title (string), status (enum: active|archived|draft),
   createdAt (Date), onDelete callback. Use Tailwind for styling.
   Follow the pattern in src/components/TeamCard.tsx"
# Haiku output: Production-ready component, correct props interface,
# Tailwind classes matching existing patterns. Indistinguishable from
# Sonnet or Opus output for this task type.

Type Generation

# Haiku prompt
> "Generate TypeScript types for the API response from
   src/services/analytics.ts. The response shape is in the
   fetch call on line 34."
# Haiku output: Correct types extracted from runtime shapes,
# proper Optional<> usage, exported with descriptive names.

Database Migrations

# Haiku prompt
> "Add a 'last_login_at' timestamp column to the users table
   in src/db/schema.ts using Drizzle. Default to null. Add
   the migration."
# Haiku output: Correct Drizzle schema update, proper column
# type, migration file generated.

Test Stubs

# Haiku prompt
> "Generate test stubs for all exported functions in
   src/services/billing.ts. Use Vitest, include describe
   blocks for each function with happy path and error
   path test cases. Don't implement the tests, just
   create the structure."
# Haiku output: Complete test file scaffold with proper
# imports, describe/it blocks, and meaningful test names.

When Haiku Fails (and How to Catch It)

Haiku fails predictably on three task categories:

1. Multi-step reasoning: If the task requires holding 5+ constraints simultaneously, Haiku drops some. Fix: escalate to Sonnet when you see incomplete output.

2. Subtle bugs: Haiku fixes the obvious symptom, not the root cause. A null check is added where a type narrowing should be. Fix: if the bug recurs after Haiku’s fix, escalate to Sonnet.

3. Architecture decisions: Haiku picks the first workable solution, not the best one. Fix: never use Haiku for design decisions. Start at Sonnet minimum.

Detection pattern:

Haiku produces output → Quick review (30 seconds)
├── Looks correct and complete → Ship it
├── Missing edge case or incomplete → Reprompt once on Haiku
│   ├── Second attempt fixes it → Ship it
│   └── Still incomplete → Escalate to Sonnet
└── Obviously wrong approach → Escalate to Sonnet immediately

Weekly Budget Tracking

Track your model usage to stay within budget:

# Check your Claude Code usage (if using API directly)
# Add this to your shell profile for easy tracking
alias claude-cost='python3 -c "
haiku_calls = int(input(\"Haiku calls this week: \"))
sonnet_calls = int(input(\"Sonnet calls this week: \"))
opus_calls = int(input(\"Opus calls this week: \"))
h = haiku_calls * 0.003
s = sonnet_calls * 0.09
o = opus_calls * 0.45
total = h + s + o
print(f\"Weekly cost: \${total:.2f}\")
print(f\"  Haiku:  \${h:.2f} ({haiku_calls} calls)\")
print(f\"  Sonnet: \${s:.2f} ({sonnet_calls} calls)\")
print(f\"  Opus:   \${o:.2f} ({opus_calls} calls)\")
print(f\"Monthly projection: \${total * 4.3:.2f}\")
"'

The 80/20 Quality Tradeoff

“80% quality” does not mean 20% of your code is bad. It means:

100% of mechanical tasks (70% of work) are at full quality
95% of standard tasks (25% of work) are at full quality
85% of complex tasks (5% of work) are at full quality because you occasionally use Sonnet where Opus would be marginally better

The weighted average: 0.70(1.0) + 0.25(0.95) + 0.05(0.85) = 0.98 – 98% quality, not 80%. The “80% cost” part is accurate though. You really do spend only 20% of what an all-Opus strategy costs.

Try It Yourself

Start today: set your default model to Haiku and only escalate when you hit one of the failure patterns above. Track your escalation rate for a week. Most developers find they escalate less than 30% of the time, confirming that 70%+ of daily coding tasks are within Haiku’s capabilities. Use the Model Selector whenever you’re unsure whether to escalate – it’s faster than guessing and more accurate than intuition.

Won't using Haiku by default slow me down?

Haiku is actually 2-4x faster than Opus in tokens per second. For mechanical tasks, you get results faster AND cheaper. The only slowdown is when Haiku fails and you need to re-run on Sonnet, but at a 70% success rate on appropriate tasks, the time savings outweigh the occasional re-run.

Is the Budget strategy good for teams or just solo developers?

Teams benefit even more because they have higher volume. A 5-person team using all-Opus spends $2,700/month. The Budget strategy brings that to $280/month. The CLAUDE.md model routing rules ensure consistency across team members.

What if my project is mostly complex work?

Some projects -- ML research, security tools, distributed systems -- have a higher ratio of complex tasks. Adjust the mix: 40% Haiku, 40% Sonnet, 20% Opus. Even with more Opus usage, you still save 50% compared to all-Opus because the mechanical tasks remain mechanical.

Can I automate model selection instead of doing it manually?

Yes. Build a task classifier that routes prompts to the right model based on keywords and complexity signals. The article on model switching includes a TypeScript implementation. For CLI usage, the /model command makes manual switching take under 2 seconds.

Quick reference → Search all commands in our Command Reference.

Model Selector – Instant model recommendations for every task
Claude Code Cost Calculator – Calculate your projected monthly costs
Token Estimator – Estimate token usage before each task
CLAUDE.md Generator – Generate model routing rules for your CLAUDE.md
Cost Optimization Guide – More strategies for reducing Claude Code costs