The Error
Extended thinking budget exceeded: used 32768 tokens of 16384 allowed
The Fix
# Increase the extended thinking token budget
claude config set thinking_budget 32768
Why This Works
Extended thinking allows Claude to reason through complex problems before responding. When the allocated budget is too low, Claude hits the ceiling mid-reasoning and the request fails. Setting the budget to 32768 tokens gives sufficient room for multi-step reasoning while keeping costs predictable.
If That Doesn’t Work
# Disable extended thinking entirely if budget errors persist
claude config set thinking_budget 0
# Or break your prompt into smaller, simpler sub-tasks
# that require less reasoning depth
If you are on a rate-limited plan, extended thinking tokens count toward your per-minute token limit. Reduce concurrent sessions or wait for the rate window to reset before retrying with a higher budget. You can also check current token usage with claude config get thinking_budget to confirm the new value persisted — some workspace-level configs override global settings.
Prevention
Add to your CLAUDE.md:
Extended thinking budget is set to 32768 tokens. For tasks requiring deep analysis (architecture decisions, complex refactors), use explicit step-by-step instructions to reduce reasoning depth needed.