On June 9, 2026, Anthropic released Claude Fable 5, the most capable model it has ever made generally available. Fable 5 is a Mythos-class model: the frontier research tier that previously lived behind a private preview, now shipped to everyone with a safeguard layer wrapped around it. It sits above Opus in the lineup, solves 80.3% of SWE-Bench Pro problems against Opus 4.8's 69.2%, holds a one million token context window, and removed several API parameters you may still be sending. This guide covers what the model does, what changed for developers, and how to turn the release into skills you can actually demonstrate.
Building toward certification?
Fable 5 is the newest model, but the architecture skills it rewards are the ones the Claude Certified Architect exam tests. Start with our Complete CCA-F Guide, keep the CCA-F Cheat Sheet handy, and measure yourself with practice tests on Preporato.
What Is Claude Fable 5?
Fable 5 is the public release of Anthropic's internal frontier tier. Anthropic has been training Mythos-class models that exceed everything in the public lineup, and until now access required a gated preview. Fable 5 is that model, identical weights, wrapped in a classifier-based safeguard system that screens requests in three high-risk areas (offensive cybersecurity, biology and chemistry, and large-scale capability distillation) and routes flagged requests to an Opus 4.8 fallback. Anthropic reports the fallback triggers in under 5% of sessions, so the overwhelming majority of users get Fable 5 answering directly.
A sibling model, Claude Mythos 5, is the same model with the cybersecurity safeguards lifted. It is restricted to vetted defenders under Anthropic's Project Glasswing program, so for nearly everyone, Fable 5 is the frontier.
Claude model lineup after June 9, 2026
| Model | API ID | Tier | Context / Max Output | Pricing (per 1M tokens in/out) |
|---|---|---|---|---|
| Claude Fable 5 | claude-fable-5 | Frontier (Mythos-class) | 1M / 128K | $10 / $50 |
| Claude Opus 4.8 | claude-opus-4-8 | Most capable Opus | 1M / 128K | $5 / $25 |
| Claude Sonnet 4.6 | claude-sonnet-4-6 | Speed and intelligence balance | 1M / 64K | $3 / $15 |
| Claude Haiku 4.5 | claude-haiku-4-5 | Fastest, cheapest | 200K / 64K | $1 / $5 |
Preparing for CCA-F? Practice with 390+ exam questions
The Capability Jump, in Numbers
The headline results from Anthropic's announcement and early access partners:
- Software engineering. 80.3% on SWE-Bench Pro (real GitHub issues, fix must pass the tests), against 69.2% for Opus 4.8. Stripe reported a 50-million-line Ruby codebase migration, normally budgeted at two months, completed in one day. Cognition measured the highest FrontierCode score among frontier models at medium effort.
- Long-horizon autonomy. Fable 5 maintains coherent work across millions of tokens and improves its own output using file-based memory: it writes notes to disk as it works and reads them back in later sessions. In Anthropic's Slay the Spire evaluation, file memory improved Fable 5 three times more than it improved Opus 4.8.
- Vision. It rebuilds web app source code from screenshots alone, reads precise values off scientific figures, and completed Pokémon FireRed through a vision-only interface where prior models needed helper tooling.
- Knowledge work and science. Highest score on Hebbia's finance benchmark, a 90% on Hex's analytics benchmark, and roughly a 10x acceleration on internal protein design tasks, with one model-generated hypothesis later corroborated by an external lab.
The pattern across all four areas is the same: the gains concentrate in long, multi-step, tool-heavy work. Which means the engineers who benefit most are the ones who already know how to structure agentic systems.
What Changed in the API
Fable 5 keeps the same request surface as Opus 4.7 and 4.8, with one addition. If you are migrating code, these are the rules:
- Adaptive thinking only.
thinking: {"type": "adaptive"}is the only on-mode. The oldbudget_tokensextended thinking returns a 400 error. - Explicit thinking-disabled is rejected. This is new in Fable 5:
thinking: {"type": "disabled"}returns a 400. If you want thinking off, omit thethinkingparameter entirely. - Sampling parameters are gone.
temperature,top_p, andtop_kall return 400 errors. Steer behavior through prompting and the effort parameter. - Effort controls depth.
output_config: {"effort": ...}runs fromlowthroughmedium,high,xhigh, andmax. High is the sensible default for intelligence-sensitive work; xhigh suits coding and agentic loops. - Prefills stay removed. Last-assistant-turn prefills return 400, as on the 4.6 generation onward. Use structured outputs (
output_config.format) instead.
Here is what the migration looks like in practice. If your code dates from the Opus 4.6 era or earlier, this call contains three things that now return 400 errors:
# Before: written for Opus 4.6, fails on Fable 5
response = client.messages.create(
model="claude-opus-4-6",
max_tokens=16000,
temperature=0.7, # 400: sampling params removed
thinking={"type": "enabled", "budget_tokens": 8000}, # 400: budget_tokens removed
messages=[
{"role": "user", "content": "Extract the fields."},
{"role": "assistant", "content": '{"name": "'}, # 400: prefills removed
],
)
# After: the same intent, expressed the Fable 5 way
response = client.messages.create(
model="claude-fable-5",
max_tokens=16000,
thinking={"type": "adaptive"},
output_config={
"effort": "high",
"format": {"type": "json_schema", "schema": EXTRACTION_SCHEMA},
},
messages=[{"role": "user", "content": "Extract the fields."}],
)
Each removed parameter has a designated replacement: adaptive thinking plus effort replaces the thinking budget, structured outputs replace JSON-forcing prefills, and prompting replaces temperature steering. One subtle case: if you previously sent thinking: {"type": "disabled"} for latency, omitting the thinking parameter is the Fable 5 equivalent, because the explicit disabled value is rejected.
On subscriptions, Fable 5 is included at no extra cost on Pro, Max, Team, and seat-based Enterprise plans through June 22, 2026, after which it moves to usage credits. API access is fully available from day one.
The Safeguard Architecture Is Worth Studying
Beyond the model itself, the deployment pattern deserves your attention as an architect. Anthropic put separate classifier models in front of Fable 5. Every request is screened; flagged requests get answered by Opus 4.8 instead of being refused outright. The bug bounty ran over a thousand hours without producing a universal jailbreak, and the system held against external red-teaming on long-form agentic tasks.
This is a production safety pattern you can learn from directly: a cheap screening layer, a capable fallback, and graceful degradation instead of hard failure. The same shape applies to systems you build, where a guardrail model screens inputs and a policy decides what the agent may do when a check fires. Designing guardrails, escalation rules, and fallback paths is exactly the kind of reliability engineering that separates a demo from a production system.
Master These Concepts with Practice
Our CCA-F practice bundle includes:
- 6 full practice exams (390+ questions)
- Detailed explanations for every answer
- Domain-by-domain performance tracking
30-day money-back guarantee
Which Model Should You Actually Use?
The question every team is asking this week. The answer depends on workload shape, and the safeguard layer makes it more interesting than a simple capability ranking.
Fable 5 vs Opus 4.8 vs Sonnet 4.6: decision guide
| Workload | Pick | Why |
|---|---|---|
| Long-horizon autonomous work: overnight refactors, large migrations, multi-day research | Fable 5 | The capability gains concentrate exactly here; 1M context plus file-based memory compound over long runs |
| Hardest one-shot reasoning where correctness beats cost: architecture review, complex debugging | Fable 5 at high or xhigh effort | An 11-point SWE-Bench Pro gap is worth 2x token pricing when a wrong answer costs engineer-days |
| Tuned production agents with cost ceilings and stable prompts | Opus 4.8 | Half the price, the same API surface, and no safeguard fallback in the loop |
| Security tooling: pentest assistants, exploit analysis, CTF agents (authorized work) | Opus 4.8 directly | Fable 5 routes flagged offensive-security requests to Opus 4.8 anyway; targeting it directly gives you one consistent model instead of a per-request coin flip |
| Biology and chemistry workflows | Opus 4.8 | Anthropic describes the bio/chem safeguards as intentionally conservative, so expect frequent fallbacks in this domain |
| High-volume production default, latency-sensitive features | Sonnet 4.6 | Best speed-to-intelligence ratio at $3/$15; the workhorse tier |
| Classification, routing, simple extraction at scale | Haiku 4.5 | Frontier reasoning is wasted on tasks a $1/$5 model handles |
The security and bio/chem rows deserve emphasis because they are counterintuitive: for those domains the newest model is the wrong default. A session that triggers the classifier gets Opus 4.8 quality while paying Fable 5 attention, and which requests trigger it is outside your control. Architecture questions about model selection always come down to matching the model to the workload, and the safeguard layer is now part of that calculation.
The Cost Math Is Less Brutal Than the Sticker Price
Fable 5 costs exactly double Opus 4.8 per token. Whether it costs double per task is a different question. Work through a realistic agentic session at list prices: 40 model calls, averaging 20K input tokens and 2K output tokens per call.
Worked example: a 40-turn agentic session (list prices, no caching)
| Model | Input cost (800K tokens) | Output cost (80K tokens) | Session total |
|---|---|---|---|
| Opus 4.8 | $4.00 | $2.00 | $6.00 |
| Fable 5 | $8.00 | $4.00 | $12.00 |
At identical turn counts, double the price means double the bill. But early-access partners report Fable 5 finishing equivalent runs 25 to 30 percent faster with fewer turns (Anthropic quotes this for spreadsheet and data work, and Cognition measured its top FrontierCode score at medium effort). If that efficiency holds for your workload and a session that took Opus 4.8 forty turns takes Fable 5 twenty-eight, the Fable 5 session costs $8.40 against $6.00, a 40% premium rather than 100%. Prompt caching narrows the absolute gap further since both models discount cache reads the same way.
Two honest caveats. The fewer-turns figure is a partner-reported result on their workloads, so treat it as a hypothesis to verify on yours before budgeting around it. And turn count is also a wall-clock and review-time number: fewer turns means less time watching an agent work, which for an engineer-in-the-loop workflow is often worth more than the token delta itself. The way to find out is to run your own workload on both models for a day and compare the usage fields, which is a one-line change now that the request surfaces match.
Frontier models reward agentic fundamentals
Fable 5 raises the ceiling on agentic systems. These hands-on labs teach the patterns it amplifies: agent loops, persistent memory, multi-agent coordination, and guardrails, all with running code in your browser.
Build an AI Agent 3 Ways: ReAct vs Tool Calling vs Plan-and-Execute
Build the same SaaS customer support agent three different ways — ReAct, direct tool calling, and plan-and-execute — then compare them on speed, reasoning quality, and reliability to learn when to use each pattern in production.
Add Long-Term Memory to an AI Agent: LangGraph + Milvus
Build a sales intelligence assistant that remembers — short-term conversation state with LangGraph checkpointer, long-term facts in Milvus, and reflection loops that auto-extract knowledge. Learn the memory architecture every production agent needs.
Build a Multi-Agent Supervisor with LangGraph
Build a supervisor agent that routes queries to specialist agents — a core architecture pattern tested on the NCP-AAI exam.
Build NeMo Guardrails for an AI Agent: Jailbreak & Topical Rails
Build a guarded IT support agent that blocks jailbreaks, refuses off-topic questions, and safely handles IT queries — using keyword checks, LLM-based validation, and NeMo Guardrails.
A More Capable Model Raises the Bar for Architects
Here is the practical takeaway for your career. Fable 5 does not replace the engineering judgment around it; it amplifies whatever architecture you give it. A 1M-token context window still suffers from lost-in-the-middle attention if you dump everything into one prompt. File-based memory only compounds across sessions if you design the memory layout. Long-horizon autonomy only pays off when your agentic loop has correct termination logic, structured error handling, and programmatic guardrails. Every one of those is a designed system, and someone has to design it.
That someone is increasingly expected to prove it. Anthropic launched the Claude Certified Architect (CCA-F) certification in March 2026 precisely because enterprises adopting Claude need architects who can build reliable systems around models like this one. The exam tests agentic loop control, tool design, MCP integration, context management, and reliability engineering: the exact skills Fable 5 rewards.
Two things to know if you are preparing:
- The current exam blueprint predates Fable 5. Questions are written against the Opus 4.8 / Sonnet 4.6 / Haiku 4.5 lineup, so anchor model-selection answers to tier reasoning rather than recency. Our exam domains breakdown covers what each domain actually tests.
- The concepts are model-agnostic. Agentic loops, stop_reason handling, tool description quality, and context strategies work the same on Fable 5 as on Sonnet. Time spent practicing them pays off on the exam and on whatever model ships next.
The fastest way to find your gaps is to test against realistic scenarios. Preporato's CCA-F practice tests mirror the real exam's scenario-based format across all five domains, with explanations that teach the architectural reasoning behind every answer.
Frequently Asked Questions
Next Steps
Frontier releases reward the engineers who already have the fundamentals. Pick one agentic pattern you have read about but never built, open the lab, and ship it end to end. Then take a timed practice test and see whether the judgment transfers under pressure.
Build an agent worthy of a frontier model
Agent loops, memory, orchestration, and guardrails: hands-on labs with running code, in your browser.
Ready to measure yourself? Take a CCA-F practice test on Preporato and find your weak domains before the exam does.
Ready to Pass the CCA-F Exam?
Join thousands who passed with Preporato practice tests
![Claude Fable 5: Capabilities, API Changes, and What AI Engineers Should Do Next [2026]](/blog/claude-fable-5-guide-for-ai-engineers.webp)