How much does Claude Fable 5 cost?

On the API, Fable 5 costs $10 per million input tokens and $50 per million output tokens, double Opus 4.8 pricing but less than half of what the old Mythos Preview cost. On subscriptions it is included at no extra cost on Pro, Max, Team, and seat-based Enterprise plans through June 22, 2026, after which it requires usage credits.

How is Fable 5 different from Opus 4.8?

Fable 5 is a full tier above Opus 4.8. On SWE-Bench Pro it solves 80.3% of problems against 69.2% for Opus 4.8, and it shows similar gains in vision, long-horizon agentic work, and scientific reasoning. The API surface is nearly identical, with one extra restriction: an explicit thinking disabled setting returns an error on Fable 5, so omit the thinking parameter instead.

When should I use Fable 5 instead of Opus 4.8?

Use Fable 5 for long-horizon autonomous work and for the hardest reasoning tasks where a wrong answer costs more than tokens do. Use Opus 4.8 for tuned production agents with cost ceilings, and for security or biology workloads, where Fable 5 routes flagged requests to Opus 4.8 anyway and a direct call gives you one consistent model. At list prices Fable 5 costs double per token, but partner-reported results suggest it completes equivalent runs in 25 to 30 percent fewer turns, which shrinks the effective premium to roughly 40 percent on workloads where that efficiency holds.

What is the difference between Fable 5 and Mythos 5?

They are the same underlying model. Fable 5 ships with classifier safeguards covering offensive cybersecurity, biology and chemistry, and capability distillation. Mythos 5 has the cybersecurity safeguards lifted and is restricted to vetted defenders under the Project Glasswing program. For nearly all developers and teams, Fable 5 is the version you can access.

Does Claude Fable 5 change the CCA-F exam?

Not yet. The current CCA-F question pool was authored against the Opus 4.8, Sonnet 4.6, and Haiku 4.5 generation, so exam answers should anchor to tier reasoning with the documented production lineup. Expect Fable 5 to appear in scenarios only after a blueprint refresh. The architectural concepts the exam tests, like agentic loops, tool design, and context management, apply to Fable 5 unchanged.

Which API parameters does Fable 5 reject?

Fable 5 returns a 400 error for temperature, top_p, top_k, the old budget_tokens extended thinking, last-assistant-turn prefills, and an explicit thinking disabled setting. Use adaptive thinking, the effort parameter inside output_config, structured outputs for format control, and prompting for behavioral steering.

What is the best way to build skills for frontier models like Fable 5?

Build working systems rather than reading announcements. The capabilities Fable 5 amplifies, including long agentic loops, persistent memory, multi-agent coordination, and guardrails, are designed systems you can practice hands-on. Preporato labs walk you through each pattern with running code in the browser, and CCA-F practice tests measure whether the architectural judgment holds up under realistic scenarios.

Claude Fable 5: Capabilities, API Changes, and What AI Engineers Should Do Next [2026]

Q: What is Claude Fable 5?

Claude Fable 5 is the model Anthropic released on June 9, 2026. It is a Mythos-class model, meaning it comes from the frontier research tier that previously required a gated private preview. It sits above Opus in the lineup, with a 1M-token context window, 128K max output tokens, and a classifier-based safeguard layer that routes flagged high-risk requests to an Opus 4.8 fallback.

On June 9, 2026, Anthropic released Claude Fable 5, the most capable model it has ever made generally available. Fable 5 is a Mythos-class model: the frontier research tier that previously lived behind a private preview, now shipped to everyone with a safeguard layer wrapped around it. It sits above Opus in the lineup, solves 80.3% of SWE-Bench Pro problems against Opus 4.8's 69.2%, holds a one million token context window, and removed several API parameters you may still be sending. This guide covers what the model does, what changed for developers, and how to turn the release into skills you can actually demonstrate.

Building toward certification?

Fable 5 is the newest model, but the architecture skills it rewards are the ones the Claude Certified Architect exam tests. Start with our Complete CCA-F Guide, keep the CCA-F Cheat Sheet handy, and measure yourself with practice tests on Preporato.

What Is Claude Fable 5?

Fable 5 is the public release of Anthropic's internal frontier tier. Anthropic has been training Mythos-class models that exceed everything in the public lineup, and until now access required a gated preview. Fable 5 is that model, identical weights, wrapped in a classifier-based safeguard system that screens requests in three high-risk areas (offensive cybersecurity, biology and chemistry, and large-scale capability distillation) and routes flagged requests to an Opus 4.8 fallback. Anthropic reports the fallback triggers in under 5% of sessions, so the overwhelming majority of users get Fable 5 answering directly.

A sibling model, Claude Mythos 5, is the same model with the cybersecurity safeguards lifted. It is restricted to vetted defenders under Anthropic's Project Glasswing program, so for nearly everyone, Fable 5 is the frontier.

Claude model lineup after June 9, 2026

Model	API ID	Tier	Context / Max Output	Pricing (per 1M tokens in/out)
Claude Fable 5	claude-fable-5	Frontier (Mythos-class)	1M / 128K	$10 / $50
Claude Opus 4.8	claude-opus-4-8	Most capable Opus	1M / 128K	$5 / $25
Claude Sonnet 4.6	claude-sonnet-4-6	Speed and intelligence balance	1M / 64K	$3 / $15
Claude Haiku 4.5	claude-haiku-4-5	Fastest, cheapest	200K / 64K	$1 / $5

Preparing for CCA-F? Practice with 390+ exam questions

Try Free View Bundle - $19.99

The Capability Jump, in Numbers

The headline results from Anthropic's announcement and early access partners:

Software engineering. 80.3% on SWE-Bench Pro (real GitHub issues, fix must pass the tests), against 69.2% for Opus 4.8. Stripe reported a 50-million-line Ruby codebase migration, normally budgeted at two months, completed in one day. Cognition measured the highest FrontierCode score among frontier models at medium effort.
Long-horizon autonomy. Fable 5 maintains coherent work across millions of tokens and improves its own output using file-based memory: it writes notes to disk as it works and reads them back in later sessions. In Anthropic's Slay the Spire evaluation, file memory improved Fable 5 three times more than it improved Opus 4.8.
Vision. It rebuilds web app source code from screenshots alone, reads precise values off scientific figures, and completed Pokémon FireRed through a vision-only interface where prior models needed helper tooling.
Knowledge work and science. Highest score on Hebbia's finance benchmark, a 90% on Hex's analytics benchmark, and roughly a 10x acceleration on internal protein design tasks, with one model-generated hypothesis later corroborated by an external lab.

SWE-BENCH PRO · REAL GITHUB ISSUES RESOLVED

Fable 5 vs the field

GPT-5.5

0.0%

Opus 4.8

0.0%

Fable 5

0.0%

+11.1 pts vs Opus 4.8

Source: Anthropic, June 2026

SWE-Bench Pro: an 11-point jump on a benchmark where progress usually arrives one or two points at a time.

The pattern across all four areas is the same: the gains concentrate in long, multi-step, tool-heavy work. Which means the engineers who benefit most are the ones who already know how to structure agentic systems.

GAIN FROM FILE-BASED MEMORY · LONG-HORIZON EVAL

Same memory, triple the compounding

1x

2x

3x

Opus 4.8

0.0x

Fable 5

0.0x

Slay the Spire eval: improvement over each model's own no-memory baseline. Fable 5 also reached the final act 3x more often. Source: Anthropic, June 2026.

The same file-based memory setup compounds three times harder on Fable 5 than on Opus 4.8.

What Changed in the API

Fable 5 keeps the same request surface as Opus 4.7 and 4.8, with one addition. If you are migrating code, these are the rules:

Adaptive thinking only. thinking: {"type": "adaptive"} is the only on-mode. The old budget_tokens extended thinking returns a 400 error.
Explicit thinking-disabled is rejected. This is new in Fable 5: thinking: {"type": "disabled"} returns a 400. If you want thinking off, omit the thinking parameter entirely.
Sampling parameters are gone. temperature, top_p, and top_k all return 400 errors. Steer behavior through prompting and the effort parameter.
Effort controls depth. output_config: {"effort": ...} runs from low through medium, high, xhigh, and max. High is the sensible default for intelligence-sensitive work; xhigh suits coding and agentic loops.
Prefills stay removed. Last-assistant-turn prefills return 400, as on the 4.6 generation onward. Use structured outputs (output_config.format) instead.

Here is what the migration looks like in practice. If your code dates from the Opus 4.6 era or earlier, this call contains three things that now return 400 errors:

# Before: written for Opus 4.6, fails on Fable 5
response = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=16000,
    temperature=0.7,                                       # 400: sampling params removed
    thinking={"type": "enabled", "budget_tokens": 8000},   # 400: budget_tokens removed
    messages=[
        {"role": "user", "content": "Extract the fields."},
        {"role": "assistant", "content": '{"name": "'},    # 400: prefills removed
    ],
)

# After: the same intent, expressed the Fable 5 way
response = client.messages.create(
    model="claude-fable-5",
    max_tokens=16000,
    thinking={"type": "adaptive"},
    output_config={
        "effort": "high",
        "format": {"type": "json_schema", "schema": EXTRACTION_SCHEMA},
    },
    messages=[{"role": "user", "content": "Extract the fields."}],
)

Each removed parameter has a designated replacement: adaptive thinking plus effort replaces the thinking budget, structured outputs replace JSON-forcing prefills, and prompting replaces temperature steering. One subtle case: if you previously sent thinking: {"type": "disabled"} for latency, omitting the thinking parameter is the Fable 5 equivalent, because the explicit disabled value is rejected.

On subscriptions, Fable 5 is included at no extra cost on Pro, Max, Team, and seat-based Enterprise plans through June 22, 2026, after which it moves to usage credits. API access is fully available from day one.

The Safeguard Architecture Is Worth Studying

Beyond the model itself, the deployment pattern deserves your attention as an architect. Anthropic put separate classifier models in front of Fable 5. Every request is screened; flagged requests get answered by Opus 4.8 instead of being refused outright. The bug bounty ran over a thousand hours without producing a universal jailbreak, and the system held against external red-teaming on long-form agentic tasks.

This is a production safety pattern you can learn from directly: a cheap screening layer, a capable fallback, and graceful degradation instead of hard failure. The same shape applies to systems you build, where a guardrail model screens inputs and a policy decides what the agent may do when a check fires. Designing guardrails, escalation rules, and fallback paths is exactly the kind of reliability engineering that separates a demo from a production system.

Try it yourself: pick a request below and watch where the classifier sends it.

Fable 5 safeguard routing · pick a requestfallback fires in <5% of sessions

Your request

Refactor a 50-file codebase

→

Classifier

no flags

→

Fable 5

frontier model answers

Opus 4.8

fallback answers

Answered by Fable 5. Standard engineering work passes the classifier and gets the full frontier model. Long-horizon refactors are exactly where Fable 5 gains concentrate.

The routing decision Anthropic makes on every Fable 5 request. Flagged domains still get an answer, served by Opus 4.8.

Master These Concepts with Practice

Our CCA-F practice bundle includes:

6 full practice exams (390+ questions)
Detailed explanations for every answer
Domain-by-domain performance tracking

Try 15 Free Questions Get Full Access - $19.99

30-day money-back guarantee

Which Model Should You Actually Use?

The question every team is asking this week. The answer depends on workload shape, and the safeguard layer makes it more interesting than a simple capability ranking.

Fable 5 vs Opus 4.8 vs Sonnet 4.6: decision guide

Workload	Pick	Why
Long-horizon autonomous work: overnight refactors, large migrations, multi-day research	Fable 5	The capability gains concentrate exactly here; 1M context plus file-based memory compound over long runs
Hardest one-shot reasoning where correctness beats cost: architecture review, complex debugging	Fable 5 at high or xhigh effort	An 11-point SWE-Bench Pro gap is worth 2x token pricing when a wrong answer costs engineer-days
Tuned production agents with cost ceilings and stable prompts	Opus 4.8	Half the price, the same API surface, and no safeguard fallback in the loop
Security tooling: pentest assistants, exploit analysis, CTF agents (authorized work)	Opus 4.8 directly	Fable 5 routes flagged offensive-security requests to Opus 4.8 anyway; targeting it directly gives you one consistent model instead of a per-request coin flip
Biology and chemistry workflows	Opus 4.8	Anthropic describes the bio/chem safeguards as intentionally conservative, so expect frequent fallbacks in this domain
High-volume production default, latency-sensitive features	Sonnet 4.6	Best speed-to-intelligence ratio at $3/$15; the workhorse tier
Classification, routing, simple extraction at scale	Haiku 4.5	Frontier reasoning is wasted on tasks a $1/$5 model handles

The security and bio/chem rows deserve emphasis because they are counterintuitive: for those domains the newest model is the wrong default. A session that triggers the classifier gets Opus 4.8 quality while paying Fable 5 attention, and which requests trigger it is outside your control. Architecture questions about model selection always come down to matching the model to the workload, and the safeguard layer is now part of that calculation.

The Cost Math Is Less Brutal Than the Sticker Price

Fable 5 costs exactly double Opus 4.8 per token. Whether it costs double per task is a different question. Work through a realistic agentic session at list prices: 40 model calls, averaging 20K input tokens and 2K output tokens per call.

Worked example: a 40-turn agentic session (list prices, no caching)

Model	Input cost (800K tokens)	Output cost (80K tokens)	Session total
Opus 4.8	$4.00	$2.00	$6.00
Fable 5	$8.00	$4.00	$12.00

At identical turn counts, double the price means double the bill. But early-access partners report Fable 5 finishing equivalent runs 25 to 30 percent faster with fewer turns (Anthropic quotes this for spreadsheet and data work, and Cognition measured its top FrontierCode score at medium effort). If that efficiency holds for your workload and a session that took Opus 4.8 forty turns takes Fable 5 twenty-eight, the Fable 5 session costs $8.40 against $6.00, a 40% premium rather than 100%. Prompt caching narrows the absolute gap further since both models discount cache reads the same way.

Two honest caveats. The fewer-turns figure is a partner-reported result on their workloads, so treat it as a hypothesis to verify on yours before budgeting around it. And turn count is also a wall-clock and review-time number: fewer turns means less time watching an agent work, which for an engineer-in-the-loop workflow is often worth more than the token delta itself. The way to find out is to run your own workload on both models for a day and compare the usage fields, which is a one-line change now that the request surfaces match.

Or model it first. Plug in your own session shape and see what the premium actually comes to:

Agentic session cost · Fable 5 vs Opus 4.8list prices, June 2026, no caching

Turns in the session40

Input tokens per turn20K

Output tokens per turn2K

Fable 5 finishes in fewer turns by0%

Early-access partners reported 25 to 30 percent fewer turns on their workloads. Drag the slider to that range to reproduce the article's worked example, or leave it at 0 for the worst case.

Opus 4.8 · 40 turns$6.00

Fable 5 · 40 turns$12.00

Effective Fable 5 premium+100%

Per-token pricing is exactly 2x. The premium per completed task is whatever the turn count makes it. Verify the efficiency on your own workload before budgeting around it.

Drag the efficiency slider to 25-30% to reproduce the partner-reported scenario from the worked example above.

Build the skills that transfer

Frontier models reward agentic fundamentals

Fable 5 raises the ceiling on agentic systems. These hands-on labs teach the patterns it amplifies: agent loops, persistent memory, multi-agent coordination, and guardrails, all with running code in your browser.

See all labs

Hostedintermediate

Build an AI Agent 3 Ways: ReAct vs Tool Calling vs Plan-and-Execute

Build the same SaaS customer support agent three different ways — ReAct, direct tool calling, and plan-and-execute — then compare them on speed, reasoning quality, and reliability to learn when to use each pattern in production.

35 minOpen lab

Hostedintermediate

Add Long-Term Memory to an AI Agent: LangGraph + Milvus

Build a sales intelligence assistant that remembers — short-term conversation state with LangGraph checkpointer, long-term facts in Milvus, and reflection loops that auto-extract knowledge. Learn the memory architecture every production agent needs.

35 minOpen lab

Hostedintermediate

Build a Multi-Agent Supervisor with LangGraph

Build a supervisor agent that routes queries to specialist agents — a core architecture pattern tested on the NCP-AAI exam.

40 minOpen lab

Hostedintermediate

Build NeMo Guardrails for an AI Agent: Jailbreak & Topical Rails

Build a guarded IT support agent that blocks jailbreaks, refuses off-topic questions, and safely handles IT queries — using keyword checks, LLM-based validation, and NeMo Guardrails.

35 minOpen lab

A More Capable Model Raises the Bar for Architects

Here is the practical takeaway for your career. Fable 5 does not replace the engineering judgment around it; it amplifies whatever architecture you give it. A 1M-token context window still suffers from lost-in-the-middle attention if you dump everything into one prompt. File-based memory only compounds across sessions if you design the memory layout. Long-horizon autonomy only pays off when your agentic loop has correct termination logic, structured error handling, and programmatic guardrails. Every one of those is a designed system, and someone has to design it.

That someone is increasingly expected to prove it. Anthropic launched the Claude Certified Architect (CCA-F) certification in March 2026 precisely because enterprises adopting Claude need architects who can build reliable systems around models like this one. The exam tests agentic loop control, tool design, MCP integration, context management, and reliability engineering: the exact skills Fable 5 rewards.

Two things to know if you are preparing:

The current exam blueprint predates Fable 5. Questions are written against the Opus 4.8 / Sonnet 4.6 / Haiku 4.5 lineup, so anchor model-selection answers to tier reasoning rather than recency. Our exam domains breakdown covers what each domain actually tests.
The concepts are model-agnostic. Agentic loops, stop_reason handling, tool description quality, and context strategies work the same on Fable 5 as on Sonnet. Time spent practicing them pays off on the exam and on whatever model ships next.

The fastest way to find your gaps is to test against realistic scenarios. Preporato's CCA-F practice tests mirror the real exam's scenario-based format across all five domains, with explanations that teach the architectural reasoning behind every answer.

Frequently Asked Questions

Next Steps

Frontier releases reward the engineers who already have the fundamentals. Pick one agentic pattern you have read about but never built, open the lab, and ship it end to end. Then take a timed practice test and see whether the judgment transfers under pressure.

From announcement to ability

Build an agent worthy of a frontier model

Agent loops, memory, orchestration, and guardrails: hands-on labs with running code, in your browser.

Ready to measure yourself? Take a CCA-F practice test on Preporato and find your weak domains before the exam does.

Ready to Pass the CCA-F Exam?

Join thousands who passed with Preporato practice tests

Start Practicing Now - $19.99

Instant access30-day guaranteeUpdated monthly

Claude Fable 5: Capabilities, API Changes, and What AI Engineers Should Do Next [2026]

Building toward certification?

What Is Claude Fable 5?

Claude model lineup after June 9, 2026

The Capability Jump, in Numbers

What Changed in the API

The Safeguard Architecture Is Worth Studying

Master These Concepts with Practice

Which Model Should You Actually Use?

Fable 5 vs Opus 4.8 vs Sonnet 4.6: decision guide

The Cost Math Is Less Brutal Than the Sticker Price

Worked example: a 40-turn agentic session (list prices, no caching)

Frontier models reward agentic fundamentals

Build an AI Agent 3 Ways: ReAct vs Tool Calling vs Plan-and-Execute

Add Long-Term Memory to an AI Agent: LangGraph + Milvus

Build a Multi-Agent Supervisor with LangGraph

Build NeMo Guardrails for an AI Agent: Jailbreak & Topical Rails

A More Capable Model Raises the Bar for Architects

Frequently Asked Questions

What is Claude Fable 5?

How much does Claude Fable 5 cost?

How is Fable 5 different from Opus 4.8?

When should I use Fable 5 instead of Opus 4.8?

What is the difference between Fable 5 and Mythos 5?

Does Claude Fable 5 change the CCA-F exam?

Which API parameters does Fable 5 reject?

What is the best way to build skills for frontier models like Fable 5?

Next Steps

Build an agent worthy of a frontier model

Ready to Pass the CCA-F Exam?

More CCA-F Articles

Claude Code Subagents & Orchestration Patterns (CCA-F Deep Dive)

Claude Code Hooks Explained: Lifecycle, Exit Codes & Exam Angles (CCA-F)

CLAUDE.md & Context Management Best Practices (2026 Guide)