What are the prerequisites for CCA-F?

There are no formal prerequisites for the CCA-F exam. You do not need other certifications, a specific degree, or a minimum number of years of experience. However, Anthropic recommends at least 6 months of hands-on experience building with the Claude API and familiarity with the Claude tool ecosystem (Agent SDK, Claude Code, MCP). Candidates without this experience typically need a longer preparation period (10-12 weeks instead of 8 weeks) and should focus heavily on hands-on practice during their study.

How much does the CCA-F exam cost?

The CCA-F exam costs $99 USD per attempt. There is a waiting period between retake attempts. Some organizations, particularly those in the Claude Partner Network, cover exam costs for their employees. Check with your employer before paying out of pocket. Anthropic occasionally offers promotional pricing or exam vouchers through their partner programs and events.

What is the passing score?

The passing score is 720 out of 1000 on a scaled scoring system. This does not mean you need to answer 72% of questions correctly, because questions are weighted differently. Production scenario questions and anti-pattern recognition questions typically carry higher weight than simple knowledge-check questions. Aim for mastery across all domains rather than trying to calculate the minimum number of questions you need to get right.

Is the exam proctored?

Yes, the CCA-F exam is online and remotely proctored. You take it from your own computer with a webcam and microphone enabled for the proctor. You will need a stable internet connection, a quiet environment, and a government-issued ID. The proctoring software monitors your screen, webcam, and audio during the exam. You cannot use external resources, second monitors, or other devices during the exam.

How long is the certification valid?

CCA-F is valid for 2 years from the date you pass the exam. After 2 years, you will need to recertify by passing the current version of the exam. This ensures that certified professionals stay current with the rapidly evolving Claude platform. The recertification exam may cover new features, updated best practices, and new architectural patterns that have been introduced since your original certification.

Can I use the Batch API for real-time applications?

No. This is a common misconception and a frequently tested anti-pattern on the exam. The Batch API offers a 50% cost reduction but provides results within a 24-hour window with no specific SLA. It is appropriate for background processing tasks like overnight document analysis, data enrichment pipelines, or content generation queues. For any workflow where a user or downstream system is waiting for the response, use the real-time API. The cost savings from the Batch API do not justify unpredictable latency in blocking workflows.

How is CCA-F different from Anthropic Academy courses?

Anthropic Academy courses are free educational resources that teach Claude skills. CCA-F is a professional certification that validates those skills under exam conditions. Think of Academy courses as the textbook and CCA-F as the final exam. Completing Academy courses is excellent preparation for CCA-F, but the certification carries a different weight in the job market because it proves you can apply the knowledge under pressure, not just that you completed the material. Academy courses are free and self-paced; the certification is $99 and proctored.

What happens if I fail the exam?

If you do not pass the CCA-F exam, you can retake it after a waiting period. You will pay the full exam fee ($99 USD) for each retake attempt. There is no limit on the number of retake attempts. When you retake the exam, you will receive a different set of 4 production scenarios (from the pool of 6), so the exam will not be identical to your previous attempt. Use the waiting period to study the domains where you felt weakest.

Do I need to know Python or a specific programming language?

The CCA-F exam is not a coding exam; you will not write code during the exam. However, you need to understand code concepts because many questions present code snippets or architectural pseudocode and ask you to identify issues or choose the correct implementation. Most examples in the official documentation and on the exam use Python, but the concepts are language-agnostic. If you can read Python at a basic level (function definitions, dictionaries, loops, conditionals), you will be able to understand the exam questions.

How many production scenarios will I get?

The exam includes a pool of 6 production scenarios, and each candidate receives 4 randomly selected scenarios. This means no two candidates take exactly the same exam. Each production scenario includes multiple questions that test different domains. For example, a single scenario about a customer service agent might include one question about agentic architecture (Domain 1), another about tool design (Domain 2), and another about escalation policies (Domain 5). Prepare for all domains because any domain can appear in any scenario.

Claude Certified Architect (CCA-F) Certification: Complete Guide [2026]

Anthropic's Claude Certified Architect - Foundations (CCA-F) is the company's first professional certification, and it has quickly become one of the most sought-after credentials in the AI industry. Unlike certifications that test theoretical knowledge or simple API usage, CCA-F validates your ability to design, build, and maintain production-grade systems powered by Claude. If you have been building with Claude's API, Agent SDK, Claude Code, or the Model Context Protocol (MCP), this certification proves you can do it at an enterprise level with proper architecture patterns, reliability engineering, and operational best practices. This guide covers every domain, every anti-pattern, and every strategy you need to pass on your first attempt.

Start Here

New to CCA-F? Start with What is CCA-F? for a high-level overview. Then use this guide for detailed domain coverage and our CCA-F Cheat Sheet for quick reference.

Exam Quick Facts

Duration

120 minutes

Cost

$99 USD

Questions

60 questions

Passing Score

720 out of 1000

Valid For

2 years

Format: Online, remotely proctored

What's New for CCA-F in 2026

Updates verified June 2026:

Partner Network gating still in effect. Anthropic announced 5,000 free seats for Claude Partner Network employees, and the exam remains gated behind partner-network access. Public-availability date has not been published yet. Any organization can apply to the partner network for free.
Claude Fable 5 launched on June 9, 2026 as a new model tier above Opus (model id claude-fable-5, 1M-token context, $10/$50 per million input/output tokens). It is Anthropic's most capable generally available model, wrapped in a classifier-based safeguard layer that routes flagged cybersecurity and biology requests to an Opus 4.8 fallback. The current exam blueprint predates it, so treat Fable 5 as lineup knowledge rather than the default answer to model-selection questions. Full breakdown in our Claude Fable 5 guide for AI engineers.
Claude Sonnet 4.6 and Haiku 4.5 remain the production-tier models you should anchor exam answers to. Older Sonnet 3.x references in third-party study material are stale.
MCP 2026.05 transport revisions (Streamable HTTP is now the recommended remote transport; SSE is deprecated for new servers). Exam questions on transport choice should default to Streamable HTTP for remote servers, stdio for local.
Claude Code config hierarchy expanded with .claude/skills/ and per-directory CLAUDE.md overrides. The exam's Configuration & Workflows domain (20%) tests file-location precedence directly.
6 production scenarios, 4 served per exam is unchanged. Public scenario themes still include customer service, code review CI, regulatory document analysis, structured-output pipelines, and multi-agent coordination.

Why CCA-F Matters in 2026

The AI certification landscape changed dramatically in 2025 and 2026. While broad certifications like AWS AI Practitioner or Google Cloud Machine Learning Engineer have existed for years, CCA-F represents something fundamentally different: a deep, vendor-specific certification that validates hands-on architecture skills with the fastest-growing foundation model platform in the enterprise market.

The Rise of Claude in Enterprise

Claude's adoption in enterprise environments has been nothing short of explosive. Anthropic's Claude Partner Network, backed by a $100 million investment from Amazon Web Services, has created an ecosystem of consulting partners, system integrators, and technology companies that rely on Claude for mission-critical applications. Major enterprises across finance, healthcare, legal, and technology sectors are deploying Claude-based systems at scale, creating massive demand for professionals who can architect these systems correctly.

This is not just about knowing how to call an API. Enterprise Claude deployments involve complex agentic architectures with multiple coordinating agents, tool integrations via MCP, context management strategies that prevent degradation over long sessions, structured output pipelines with validation and retry logic, and reliability patterns that handle failures gracefully. CCA-F tests all of these skills.

Career Differentiation

The job market for AI engineers is increasingly crowded. Everyone can prompt an LLM. Many can write basic API integration code. But far fewer professionals can architect a production-grade agentic system that coordinates multiple Claude subagents, manages context windows effectively, implements proper error handling and escalation policies, and maintains reliability under real-world conditions. CCA-F certification distinguishes you as someone who operates at that architectural level.

Market Opportunity

According to industry reports, demand for AI architects with production deployment experience grew 340% between 2024 and 2026. Organizations that have adopted Claude as their primary foundation model are actively seeking CCA-F certified professionals for roles ranging from AI Architect ($180K-$280K) to Principal AI Engineer ($220K-$350K). The certification is increasingly listed as preferred or required in job postings across major technology companies, financial institutions, and consulting firms.

The Claude Partner Network Effect

Anthropic's Claude Partner Network has created a formal ecosystem where certified professionals carry significant weight. Partners are evaluated partly on the number of CCA-F certified architects on their teams. This means consulting firms, system integrators, and managed service providers are actively incentivizing their engineers to earn the certification, often covering exam costs and providing study time. If you work for or aspire to join one of these organizations, CCA-F is quickly becoming a baseline expectation.

What CCA-F Proves About You

Holding the CCA-F certification signals several things to employers and clients:

Production Mindset: You think beyond prototypes and demos. You understand the reliability, scalability, and operational concerns that matter in production.
Architecture Skills: You can design multi-agent systems, not just single-turn API calls. You understand when to use subagents, how to isolate context, and when to parallelize.
Tool Ecosystem Mastery: You know the Claude tool ecosystem deeply, from the API and Agent SDK to Claude Code, MCP, and the Batch API.
Anti-Pattern Awareness: You know what NOT to do, which is often more valuable than knowing what to do. The exam heavily tests your ability to identify and avoid common architectural mistakes.
Current Knowledge: The certification has a two-year validity period, ensuring certified professionals stay current with the rapidly evolving Claude platform.

Preparing for CCA-F? Practice with 390+ exam questions

Try Free View Bundle - $19.99

CCA-F Exam Overview

The 6 production scenarios that ground all CCA-F exam questions

Before diving into the five domains, let us understand the exam structure, format, and scoring methodology. The CCA-F exam is carefully designed to test practical, applied knowledge rather than rote memorization.

Exam Format and Structure

The CCA-F exam consists of 60 questions to be completed in 120 minutes. This gives you an average of 2 minutes per question, though the time distribution is uneven. Simple recall questions might take 30 seconds, while scenario-based production questions can require 4-5 minutes of careful analysis.

CCA-F Exam Format Details

Aspect	Details
Total Questions	60 multiple-choice and multiple-select
Time Limit	120 minutes
Passing Score	720 out of 1000
Proctoring	Online, remotely proctored via webcam
Question Types	Single-answer, multiple-answer, scenario-based
Production Scenarios	6 available, 4 randomly selected per exam
Retake Policy	waiting period between attempts
Validity	2 years from date of passing
Cost	$99 USD per attempt
Languages	English
Prerequisites	None (experience recommended)

The Production Scenario Questions

One of the most distinctive features of the CCA-F exam is its production scenario questions. The exam includes a pool of 6 production scenarios, and each candidate receives 4 randomly selected scenarios during their exam. These are not simple "what does this API do?" questions. Instead, they present realistic enterprise situations where you must make architectural decisions.

A typical production scenario might describe:

A financial services company deploying a multi-agent system for regulatory document analysis
A healthcare platform using Claude to triage patient inquiries with strict escalation requirements
An e-commerce company building a customer service agent that integrates with multiple backend systems
A legal firm automating contract review with structured output requirements

For each scenario, you will face multiple questions that test different aspects of the architecture. One question might focus on the agentic loop design, another on tool integration, and another on error handling or context management.

The Critical Distinction: Programmatic vs Prompt-Based

If there is one meta-principle that runs through the entire CCA-F exam, it is this: programmatic enforcement is almost always preferred over prompt-based guidance. This distinction appears across every domain and is the single most reliable tiebreaker when you are stuck between two answer choices.

What does this mean in practice?

Ordering and prerequisites: Do not tell Claude "complete step A before step B" in a prompt. Instead, use programmatic workflow logic that only calls step B after step A returns successfully.
Output format: Do not rely on prompt instructions like "always return valid JSON." Instead, use tool_use JSON schemas for structural enforcement, then add validation and retry loops.
Termination conditions: Do not use natural language parsing to detect when an agent should stop. Instead, check the stop_reason field programmatically: "tool_use" means continue the loop, "end_turn" means the model has finished.
Escalation rules: Do not ask Claude to "escalate when it seems appropriate." Instead, define explicit programmatic triggers: the user explicitly requests a human, the policy is ambiguous, or the agent has made no progress after N attempts.

This principle reflects Anthropic's production-first philosophy. Prompts are suggestions; code is enforcement. The exam rewards candidates who understand this deeply.

Scoring Methodology

The CCA-F uses a scaled scoring system with a maximum of 1000 points and a passing threshold of 720. Not all questions are weighted equally. Production scenario questions and questions testing anti-pattern recognition typically carry higher weight than straightforward knowledge-check questions.

You will not receive a domain-by-domain score breakdown, but you will receive an overall score and a pass/fail result immediately upon completing the exam.

CCA-F exam domain weight distribution across five competency areas

The Five Exam Domains (Detailed)

The CCA-F exam is organized into five domains, each covering a distinct area of Claude architecture. Together, they span the full lifecycle of building, deploying, and operating production Claude systems. Let us examine each domain in depth.

Domain 1: Agentic Architecture & Orchestration (27%)

Domain 1 is the largest and most important domain on the exam, carrying 27% of the total weight. It covers the fundamental patterns for building agentic systems with Claude, from the basic agentic loop to complex multi-agent orchestration.

The Agentic Loop

The agentic loop is the foundational pattern for all Claude-based agents. At its core, the loop follows a simple cycle: send a message to Claude, receive a response, check whether Claude wants to use a tool, execute the tool if so, send the result back, and repeat until Claude indicates it is done.

The critical implementation detail is how you determine when to continue and when to stop. The correct approach is always programmatic: check the stop_reason field in the API response.

stop_reason = "tool_use" means Claude wants to call a tool. Your code should execute the tool and continue the loop.
stop_reason = "end_turn" means Claude has completed its response. Your code should exit the loop and return the result.
stop_reason = "max_tokens" means the response was truncated due to token limits. Your code should handle this as an edge case, potentially by requesting a continuation.

This is the canonical pattern that the exam tests repeatedly. The key anti-pattern to avoid is parsing Claude's natural language output to determine whether the agent should continue. For example, checking if Claude said "I'm done" or "Task complete" is fragile, unreliable, and explicitly called out as an anti-pattern on the exam. Natural language is inherently ambiguous. Claude might say "I've completed the first step" (which means continue) or "I've completed everything" (which means stop), and trying to parse these distinctions with string matching or even another LLM call is error-prone and unnecessary when stop_reason gives you a clean, deterministic signal.

Another anti-pattern is setting arbitrary iteration caps like "stop after 10 loops." While safety limits are reasonable (and recommended as a backstop), using them as the primary termination mechanism rather than checking stop_reason is incorrect. The correct approach is to use stop_reason as the primary termination signal and an iteration cap as a safety net. A good practice is to set a generous iteration cap (e.g., 50 or 100 iterations depending on the task complexity) and log a warning when it is hit, because hitting the cap usually indicates a bug in the agent's tool design or prompt rather than normal operation.

Agentic Loop Implementation Considerations

Beyond the basic loop pattern, the exam tests several implementation nuances:

Streaming vs Non-Streaming: In production, you often want to stream responses so that users see partial results in real-time. When streaming, the stop_reason is provided in the final event of the stream. Your implementation must handle both the streaming content events and the final stop event correctly.

Error Recovery Within the Loop: When a tool call fails, you should not simply exit the loop. Instead, return the error to Claude as a tool result (with structured error information), allowing Claude to decide whether to retry the tool, try an alternative approach, or inform the user. This keeps the agentic loop in control of error handling decisions while providing Claude with the information it needs to make good decisions.

Token Budget Management: Long-running agentic loops can accumulate significant token costs. Production implementations should track cumulative token usage across loop iterations and implement budget controls. This is distinct from iteration caps: you might have a generous iteration cap but a strict token budget that terminates the loop if costs exceed a threshold.

Hub-and-Spoke Architecture

For complex applications that require multiple specialized capabilities, the CCA-F emphasizes the hub-and-spoke architecture pattern. This pattern consists of:

A coordinator agent (the hub): Receives the user's request, determines which specialized agent(s) to invoke, routes tasks appropriately, and synthesizes results.
Specialized subagents (the spokes): Each handles a specific domain or capability. For example, one subagent might handle database queries, another might handle document analysis, and a third might handle email composition.

The coordinator does not do the work itself. Its role is routing, delegation, and synthesis. This separation of concerns is crucial for several reasons:

Context efficiency: Each subagent only receives the context it needs, rather than the entire conversation history.
Specialization: Each subagent can have its own system prompt, tool set, and behavioral configuration optimized for its specific task.
Reliability: If one subagent fails, the coordinator can retry it or route to a fallback without affecting other subagents.
Scalability: New capabilities can be added by creating new subagents without modifying existing ones.

Subagent Context Isolation

One of the most heavily tested concepts in Domain 1 is subagent context isolation. The rule is simple but critical: subagents do NOT inherit the parent agent's conversation context. When you spawn a subagent, it starts with a clean context. You must explicitly pass any information the subagent needs.

This is a deliberate design choice, not a limitation. Context isolation prevents several problems:

Context pollution: Subagents do not receive irrelevant information from other conversation threads.
Security: Sensitive information from the user's conversation does not leak to subagents that do not need it.
Predictability: Each subagent's behavior is determined only by what you explicitly provide, making the system easier to reason about and debug.

The exam tests this concept from multiple angles. You might see a scenario where a subagent is "failing to access information" that was discussed earlier in the conversation with the coordinator. The correct diagnosis is that the information was not passed to the subagent (because context is not inherited), and the fix is to explicitly include the relevant information when creating the subagent.

Parallel Task Execution

When multiple independent operations need to happen simultaneously, the CCA-F tests your understanding of parallel Task calls. The Task object in the Agent SDK allows you to create multiple agent invocations that run concurrently.

Key principles for parallel execution:

Independence requirement: Only parallelize tasks that are truly independent. If task B depends on the output of task A, they must run sequentially.
Result aggregation: The coordinator must gather results from all parallel tasks and synthesize them into a coherent response.
Error handling: If one parallel task fails, the coordinator must decide whether to retry it, skip it, or fail the entire operation. This decision depends on whether the task is critical or optional.

A common exam scenario involves a system that needs to search multiple data sources simultaneously. The correct approach is to launch parallel Tasks for each data source, then have the coordinator aggregate and deduplicate the results.

Session Management

Production agentic systems need to handle interruptions, resumptions, and branching. The CCA-F tests your understanding of two key session management features:

--resume: Continues an existing session from where it left off. The conversation history is preserved, and the agent can pick up exactly where it stopped.
fork_session: Creates a new session that starts with a copy of the current conversation history but diverges from that point forward. This is useful for "what if" scenarios or exploratory branches.

Understanding when to use each is critical. Resumption is for continuing interrupted work. Forking is for exploration without affecting the main conversation thread.

The exam tests these concepts in scenarios like: a customer service agent needs to explore two different resolution paths before committing to one (use fork_session to explore each path independently), or a development agent was interrupted by a system restart and needs to continue where it left off (use --resume to restore the session).

Common Domain 1 Exam Patterns

The exam tests Domain 1 through several recurring question patterns that you should be prepared for:

Architecture Selection: Given a business requirement, choose the correct architecture. Single-agent for simple, focused tasks. Hub-and-spoke for complex tasks requiring multiple specializations. Parallel tasks for independent operations on multiple data sources. The key is matching complexity to architecture rather than defaulting to the most complex option.

Debugging Failures: A scenario describes a multi-agent system that is producing incorrect results. Common diagnoses include: subagent not receiving necessary context (context isolation violation), coordinator routing to the wrong subagent (tool description problem from Domain 2), or agentic loop not terminating properly (stop_reason not being checked).

Scaling Decisions: When a system needs to handle more volume, the correct approach usually involves parallelization (more parallel tasks) or horizontal scaling (more agent instances), not making individual agents more complex. The exam rewards answers that maintain architectural simplicity while scaling throughput.

Domain 2: Tool Design & MCP Integration (18%)

Domain 2 covers how Claude interacts with external systems through tools and the Model Context Protocol. This domain carries 18% of the exam weight and focuses heavily on practical design decisions that affect reliability and accuracy in production.

Tool Descriptions as the Primary Routing Mechanism

This is one of the most important concepts in Domain 2: tool descriptions are the primary mechanism by which Claude decides which tool to call. When Claude receives a user request and has multiple tools available, it reads the tool descriptions to determine which tool is most appropriate.

This means your tool descriptions must be:

Clear and specific: Describe exactly what the tool does, not just its name.
Differentiating: If you have similar tools, the descriptions must clearly explain when to use each one.
Accurate about limitations: If a tool only works with certain input types or has specific requirements, say so in the description.
Action-oriented: Describe the tool in terms of what it does and when to use it, not just what it is.

Poor tool descriptions are one of the most common causes of incorrect tool routing in production systems. The exam tests your ability to diagnose routing problems by examining tool descriptions and identify how to fix them.

Optimal Tool Count

Research and practical experience have established clear guidelines for tool count:

4-5 tools is the optimal range for most Claude agents. At this level, the model can reliably distinguish between tools and route accurately.
Above 18 tools, routing accuracy degrades significantly. The model struggles to differentiate between many similar tools and makes more routing errors.
The solution for complex systems is not more tools but better architecture. Use the hub-and-spoke pattern with specialized subagents, each having their own focused set of 4-5 tools.

The exam frequently presents scenarios where a team has added too many tools and is experiencing routing problems. The correct answer is almost never "improve the tool descriptions" (though that might help marginally). The correct answer is to restructure the architecture so that each agent has fewer, more focused tools.

tool_choice Configuration

The tool_choice parameter controls how Claude interacts with available tools:

auto (default): Claude decides whether to use a tool or respond directly. This is appropriate for most general-purpose agents.
any: Forces Claude to use one of the available tools. The model chooses which tool, but it must use one. This is useful for tool-focused workflows where a direct text response is never appropriate.
{type: "tool", name: "specific_tool"}: Forces Claude to use a specific named tool. This is useful for structured extraction or when you programmatically know which tool should be called.
none: Prevents Claude from using any tools. Useful for analysis or synthesis steps where tool use would be inappropriate.

The exam tests your ability to choose the correct tool_choice setting for different scenarios. A common question pattern describes a workflow step and asks which tool_choice configuration is most appropriate.

MCP Configuration

The Model Context Protocol (MCP) allows Claude to connect to external tools and data sources through a standardized protocol. The CCA-F tests your understanding of MCP configuration at two levels:

Project-level configuration (.mcp.json): Located in the project root, this file configures MCP servers that are available to everyone working on the project. It is typically committed to version control.
User-level configuration (~/.claude.json): Located in the user's home directory, this file configures MCP servers that are specific to the individual user. It is not shared with the team.

The key distinction the exam tests is scoping. Project-level MCP configuration applies to everyone on the project and should contain shared tools (like a project-specific database connector). User-level configuration should contain personal tools (like a user's personal GitHub integration).

Environment Variable Expansion

MCP configuration files support environment variable expansion using the ${VARIABLE_NAME} syntax. For example, ${GITHUB_TOKEN} in a .mcp.json file will be replaced with the value of the GITHUB_TOKEN environment variable at runtime.

This is important for security: you should never hardcode secrets in MCP configuration files. Instead, reference environment variables that are set in the deployment environment. The exam tests this pattern in scenarios where teams need to configure MCP servers with authentication credentials.

Structured Error Handling in Tools

When a tool fails, how you report the failure back to Claude matters enormously. The CCA-F emphasizes structured error responses that include:

Error type: What category of error occurred (authentication, rate limit, not found, server error, etc.)
isRetryable flag: Whether the error is transient (and thus worth retrying) or permanent (and thus requiring a different approach). This is a critical distinction that the exam tests heavily.
Contextual information: What was the tool trying to do when it failed, and what information is available about the failure.

The anti-pattern is returning vague error messages like "tool call failed" or, worse, silently returning empty results when a failure occurs.

Access Failures vs Empty Results

This is a subtle but critical distinction that appears frequently on the exam:

Access failure: The tool could not complete its operation (network error, authentication failure, timeout). Claude needs to know the operation failed so it can retry or try an alternative approach.
Empty result: The tool completed successfully but found no data. Claude needs to know the search succeeded but returned nothing, so it can inform the user rather than trying the same search again.

If you return an empty result when an access failure occurred, Claude will incorrectly conclude that the data simply does not exist, rather than understanding that the retrieval failed. This can lead to confidently incorrect responses.

Common Domain 2 Exam Patterns

Domain 2 questions frequently test your ability to diagnose tool-related problems in production scenarios:

Routing Accuracy Degradation: A team reports that Claude is calling the wrong tool more frequently after adding new tools. The diagnosis is almost always that the total tool count has exceeded the optimal range. The fix is architectural restructuring (hub-and-spoke with specialized subagents), not better descriptions.

Tool Description Analysis: The exam may present two or more tool descriptions and ask which has a problem. Look for descriptions that are too vague ("handles data operations"), too similar to other tools (making routing ambiguous), or missing critical information about when to use the tool versus alternatives.

MCP Configuration Debugging: A scenario describes a team where MCP tools work for one developer but not others. The diagnosis typically involves configuration scoping issues: the working developer has the configuration in their user-level ~/.claude.json, but the project-level .mcp.json is missing or misconfigured. The fix is to add the shared configuration to .mcp.json in the project root.

Error Handling Design: Given a tool that can fail in multiple ways (authentication expired, rate limited, resource not found, server error), design the structured error response. The exam rewards answers that include the error type, the isRetryable flag, and contextual information that helps Claude decide its next action.

Domain 3: Claude Code Configuration & Workflows (20%)

Domain 3 covers Claude Code, Anthropic's CLI tool for AI-assisted development. This domain carries 20% of the exam weight and focuses on configuration, workflow patterns, and team collaboration.

CLAUDE.md Hierarchy

CLAUDE.md files provide instructions to Claude Code, and they follow a hierarchical scoping system:

User-level (~/.claude/CLAUDE.md): Personal instructions that apply to all projects for a specific user. These might include preferred coding style, personal workflow preferences, or default behaviors.
Project-level (./CLAUDE.md in project root): Instructions that apply to the entire project. These typically include project architecture, coding standards, testing requirements, and deployment procedures. This file is committed to version control and shared with the team.
Directory-level (./src/components/CLAUDE.md): Instructions that apply only when working within a specific directory. These are useful for component-specific patterns, directory-specific conventions, or subsystem-specific requirements.

The hierarchy works through specificity: more specific (directory-level) instructions take precedence over more general (project-level) instructions, which take precedence over user-level instructions. All applicable levels are loaded and combined, with more specific instructions overriding general ones where they conflict.

The Team Instructions Trap

This is one of the most commonly tested gotchas in Domain 3: user-level CLAUDE.md instructions are NOT shared with the team. If a developer puts team coding standards in their personal ~/.claude/CLAUDE.md, those standards only apply when that specific developer is using Claude Code. Other team members will not see or follow those instructions.

The correct approach is to put shared team instructions in the project-level ./CLAUDE.md file (committed to version control) or in .claude/rules/ files within the project.

The exam frequently presents scenarios where a team lead has established coding standards but other developers are not following them. The diagnosis is almost always that the standards were placed in the wrong CLAUDE.md level.

Skills Frontmatter

Claude Code skills are reusable instruction sets that can be invoked on demand. They are defined with frontmatter that controls their behavior:

context: fork: When this skill is invoked, it runs in a forked context. This means it gets a copy of the current conversation but does not affect the main conversation when it completes. This is useful for exploratory or analytical skills.
allowed-tools: Specifies which tools the skill is allowed to use. This provides security boundaries, ensuring that a skill cannot use tools outside its intended scope.

Skills are a powerful mechanism for creating reusable, composable workflows. The exam tests your ability to configure skills appropriately for different use cases.

Rules Files with Glob Patterns

The .claude/rules/ directory within a project can contain rule files that apply conditionally based on file glob patterns. For example:

.claude/rules/react-components.md with a glob of *.tsx would apply its rules whenever Claude Code is working with TypeScript React files.
.claude/rules/api-routes.md with a glob of */routes/** would apply when working within the routes directory.

This is a more granular alternative to directory-level CLAUDE.md files. Rules files are particularly useful when different file types within the same directory need different treatment.

Plan Mode vs Direct Execution

Claude Code supports two primary modes of operation:

Plan Mode: Claude analyzes the task, creates a detailed plan, and presents it for review before making any changes. This is appropriate for complex refactoring, architectural changes, or any situation where the consequences of incorrect changes would be significant.
Direct Execution: Claude analyzes and implements changes in a single pass. This is appropriate for well-understood, smaller changes where the risk of incorrect implementation is low.

The exam tests your judgment about when to use each mode. The general principle is that plan mode should be used when the scope is large, the consequences of errors are high, or the requirements are ambiguous. Direct execution is appropriate for small, well-defined tasks.

CI/CD Integration

Claude Code can be integrated into CI/CD pipelines using the -p flag (pipe mode), which enables non-interactive operation. In this mode, Claude Code reads from stdin, processes the input, and writes to stdout without requiring human interaction.

Common CI/CD use cases tested on the exam:

Automated code review: Claude Code reviews pull requests and provides feedback.
Documentation generation: Claude Code generates or updates documentation based on code changes.
Test generation: Claude Code writes tests for new or modified code.

A critical principle tested in the exam is independent review over self-review. When using Claude Code for code review, the reviewing instance should be independent of the instance that wrote the code. Having the same Claude instance review its own output is subject to self-review bias and is significantly less effective at catching errors.

Common Domain 3 Exam Patterns

Configuration Hierarchy Debugging: The most common Domain 3 question type presents a scenario where Claude Code is not following expected instructions. You need to diagnose where the instructions are configured and whether they are at the correct level. Remember: user-level is personal, project-level is shared, and directory-level is scoped to specific paths. The most frequent mistake in practice (and the most tested on the exam) is placing shared team standards at the user level.

Skills Configuration: The exam tests your ability to configure skills for specific use cases. Key decisions include whether to use context: fork (for exploratory or analytical skills that should not modify the main conversation) and which tools to allow. A security-sensitive skill (like one that generates deployment configurations) should have a restricted allowed-tools list to prevent unintended side effects.

CI/CD Pipeline Design: When Claude Code is used in automated pipelines, the -p flag enables non-interactive mode, but the exam also tests broader pipeline design principles. For code review pipelines, use a separate Claude Code instance from the one that generated the code. For documentation generation, configure appropriate CLAUDE.md instructions that specify documentation standards. For test generation, scope the Claude Code instance to only have access to testing-related tools and files.

Plan Mode Decision Making: The exam presents scenarios and asks whether Plan Mode or Direct Execution is more appropriate. The heuristic is straightforward: use Plan Mode when the change is large, complex, or high-risk; use Direct Execution when the change is small, well-understood, and low-risk. Edge cases where Plan Mode is preferred include any refactoring that touches more than 3-5 files, any change to core business logic, and any modification to configuration that affects production environments.

Domain 4: Prompt Engineering & Structured Output (20%)

Domain 4 covers the art and science of crafting effective prompts and ensuring Claude produces reliable, structured output. It carries 20% of the exam weight and bridges the gap between theoretical prompt engineering and production-grade output pipelines.

Explicit Criteria Over Vague Guidance

The CCA-F exam strongly favors prompts with explicit, measurable criteria over vague instructions. Consider the difference:

Vague (anti-pattern):

"Write a good summary of this document."

Explicit (correct):

"Write a summary of this document in 3-5 sentences. Include: the main argument, the key evidence cited, and the conclusion. Use formal academic tone. Do not include direct quotes."

Every attribute is measurable and verifiable. "Good" is subjective and unmeasurable. This distinction matters because production systems need consistent, predictable output. When criteria are explicit, you can validate the output programmatically and retry if it does not meet specifications.

The exam presents scenarios where Claude's output is inconsistent or unpredictable, and the correct diagnosis is often that the prompt uses vague rather than explicit criteria.

Few-Shot Examples

Including 2-4 examples in your prompt (few-shot prompting) significantly improves output consistency and accuracy. The CCA-F tests several nuances:

2-4 examples is the optimal range. Fewer than 2 may not establish the pattern clearly. More than 4 typically provides diminishing returns and consumes valuable context space.
Examples should cover edge cases, not just the happy path. If your task involves ambiguous inputs, include an example showing how to handle ambiguity.
Examples should demonstrate the exact format you expect in the output, including field names, data types, and structure.
Diversity matters: Your examples should cover different categories or types of inputs to show the range of expected behavior.

tool_use JSON Schemas

One of the most powerful techniques for ensuring structured output from Claude is using tool_use with JSON schemas. When you define a tool with a specific JSON schema, Claude's responses are constrained to match that schema structurally.

Critical nuance tested on the exam: tool_use schemas enforce syntax, not semantics. The schema ensures that the output has the right fields, data types, and structure. It does NOT ensure that the values are correct, meaningful, or consistent. For example, a schema can ensure that an "age" field is an integer, but it cannot ensure that the integer is a plausible human age.

This means you need both:

Schema enforcement for structural correctness (guaranteed by tool_use)
Validation logic for semantic correctness (implemented in your code)

Nullable Fields and "Unclear" Enum Values

Production data is messy. Sometimes the information Claude needs to extract from a document is ambiguous or missing. The CCA-F tests two strategies for handling this:

Nullable fields: Define schema fields as nullable when the information might not be present. This allows Claude to explicitly signal "this information is not available" rather than guessing or hallucinating a value.
"Unclear" enum values: When a field uses an enum (a fixed set of allowed values), include an "unclear" or "ambiguous" option. This gives Claude a valid way to signal uncertainty rather than forcing it to choose an incorrect category.

Both strategies reflect the same principle: it is better to know that you do not know than to have a confident but wrong answer. Production systems can then route "unclear" or null values to human review rather than proceeding with incorrect data.

Validation Retry Loops

Even with schemas and explicit prompts, Claude's output sometimes fails validation. The production-grade approach is a validation retry loop:

Claude generates output (constrained by tool_use schema)
Your code validates the output against semantic rules
If validation fails, send the output AND the validation errors back to Claude
Claude generates a corrected version
Repeat until validation passes or a maximum retry count is reached

The key insight tested on the exam is that the retry prompt should include both the invalid output and the specific validation errors. This gives Claude the context it needs to correct its mistakes rather than making the same errors again.

This is another example of the "programmatic over prompt-based" principle. Rather than trying to write a perfect prompt that never produces invalid output (impossible), you build a programmatic loop that catches and corrects errors.

The Batch API

The Batch API is an important topic on the CCA-F exam. It allows you to submit large numbers of API requests as a batch, with significant cost savings but important trade-offs:

Cost: 50% reduction compared to real-time API calls.
Latency: Results are returned within 24 hours (no specific SLA within that window).
Limitations: No multi-turn conversations. Each request in the batch is independent.
Use cases: Large-scale document processing, data extraction, content generation where real-time response is not required.

The critical anti-pattern tested on the exam is using the Batch API for blocking workflows. Because the Batch API has no SLA within its 24-hour window (your results might come back in 5 minutes or 23 hours), it is inappropriate for any workflow where downstream processes are waiting for the results. The Batch API is for background processing, not for real-time pipelines.

Multi-Instance Review

For critical outputs where accuracy is paramount, the CCA-F tests the multi-instance review pattern:

Generate the output with one Claude instance
Review the output with a separate, independent Claude instance
If the reviewer identifies issues, either flag for human review or send back for correction

The key principle is that the reviewing instance must be independent of the generating instance. This is the "independent review over self-review" principle from Domain 3, applied to output validation rather than code review. An independent instance catches errors that the generating instance would overlook because it has the same blind spots.

Common Domain 4 Exam Patterns

Prompt Diagnosis: The exam presents a prompt that produces inconsistent results and asks you to identify the problem. Common diagnoses include vague criteria (replace with explicit, measurable specifications), missing examples (add 2-4 diverse few-shot examples), or format ambiguity (add a tool_use schema to enforce structure).

Schema Design: Given a data extraction task, design the tool_use schema. The exam tests whether you include nullable fields for optional or uncertain data, "unclear" enum values for ambiguous categories, and appropriate data types. A common trap is designing a schema that only handles the happy path and fails when real-world data is messy or incomplete.

Batch API Appropriateness: The exam describes a processing pipeline and asks whether the Batch API is appropriate. The key question is always: is anything waiting for the results in real-time? If yes, the Batch API is inappropriate regardless of cost savings. If no, and the volume justifies the integration effort, the Batch API's 50% cost savings make it an excellent choice. Remember also that the Batch API does not support multi-turn conversations, so any task requiring back-and-forth interaction is automatically disqualified.

Validation Pipeline Design: The exam may ask you to design a complete validation pipeline for structured output. The correct answer typically involves three layers: tool_use schema for syntactic enforcement, programmatic validation for semantic correctness (business rules, data consistency, range checks), and a retry loop that sends validation errors back to Claude for correction. Missing any of these three layers is considered incomplete.

Domain 5: Context Management & Reliability (15%)

Domain 5 covers the operational concerns that separate production systems from prototypes. It carries 15% of the exam weight but often determines whether candidates pass or fail, because these topics require practical experience to answer correctly.

The Lost-in-the-Middle Phenomenon

Large language models, including Claude, exhibit a well-documented phenomenon called "lost in the middle." When presented with a long context, the model pays more attention to information at the beginning and end of the context and less attention to information in the middle.

The CCA-F tests three mitigation strategies:

Persistent case facts at the top: If there is critical information that must be referenced throughout a conversation (like a customer's account details or a legal clause under analysis), place it at the beginning of the context where attention is highest.
Trim verbose outputs: When tool results or retrieved documents are very long, trim them to include only the relevant portions. Less total context means less "middle" for information to get lost in.
Summaries at the top: For long conversations, periodically create summaries of the key information and place them at the beginning of the context. This ensures that the most important facts are in the high-attention zone.

The exam frequently presents scenarios where an agent "forgets" or ignores information that was provided earlier. The correct diagnosis is usually the lost-in-the-middle effect, and the correct solution involves repositioning information or managing context length.

Escalation Triggers

When should an agentic system escalate to a human? This is a critical production concern that the CCA-F tests extensively. The correct escalation triggers are:

The user explicitly requests a human. If the user says "I want to talk to a person" or "transfer me to a human agent," the system must comply immediately. This is non-negotiable.
The policy is ambiguous. If the agent encounters a situation where the applicable policy is unclear (for example, a refund request that falls between two policy categories), it should escalate rather than guess.
The agent has made no progress. If the agent has attempted the same approach multiple times without making progress (typically defined as N attempts with no state change), it should escalate.

Critically, the exam tests what should NOT be an escalation trigger:

Sentiment analysis is NOT a reliable escalation trigger. The anti-pattern is using sentiment detection (e.g., "the customer seems angry") to trigger escalation. LLMs are poorly calibrated for sentiment analysis, and using sentiment as an escalation trigger leads to both false positives (escalating satisfied customers who use informal language) and false negatives (failing to escalate truly frustrated customers who communicate politely).
Self-reported confidence is NOT a reliable escalation trigger. Asking Claude "how confident are you?" and escalating based on the answer is unreliable because LLM self-assessed confidence is poorly calibrated.

Structured Error Propagation

When errors occur in production agentic systems, how they propagate through the system matters enormously. The CCA-F tests structured error propagation, where errors include:

Error type and code: A machine-readable classification of what went wrong.
Context: What the system was trying to do when the error occurred.
Retryable flag: Whether the error is transient or permanent.
Partial results: If the system completed some work before the error, those results should be preserved.
Suggested action: What the system or user should do next.

The anti-pattern is unstructured error handling where errors are logged but not propagated to the agent, or where they are propagated as plain text strings without machine-readable metadata.

Context Degradation Strategies

As conversations grow longer, context window limitations become a practical concern. The CCA-F tests several strategies for managing context degradation:

Scratchpad pattern: The agent maintains a structured scratchpad that summarizes key information from the conversation. When the context grows too long, the scratchpad preserves the essential facts while the raw conversation history is trimmed.
Subagent delegation: For complex tasks that would consume too much context in a single agent, delegate subtasks to subagents. Each subagent gets a fresh context with only the information it needs, and returns a compact result to the coordinator.
/compact command: In Claude Code, the /compact command summarizes the current conversation and starts a fresh context with the summary. This is the manual trigger for context management.

The exam tests your judgment about when each strategy is appropriate. Scratchpads work well for single-agent systems with gradually growing context. Subagent delegation is better for systems that need to process large volumes of diverse information. The /compact command is appropriate for interactive sessions where a human can trigger context management.

Crash Recovery with Manifest Files

Production agentic systems must handle crashes gracefully. The CCA-F tests the manifest file pattern for crash recovery:

Before starting a batch of work, write a manifest file that describes all the tasks to be completed.
As each task completes, update the manifest to record the completion status.
If the system crashes and restarts, read the manifest to determine which tasks were completed and which need to be retried.

This pattern ensures that work is not lost and not duplicated when crashes occur. The exam tests your understanding of what information should be in the manifest (task list, completion status, partial results) and how the recovery process should work.

Human Review with Stratified Sampling

Not every output from an AI system needs human review, but some outputs are higher risk than others. The CCA-F tests stratified sampling for human review:

Stratify by risk: Classify outputs into risk categories (e.g., low-risk routine responses, medium-risk edge cases, high-risk financial or medical outputs).
Sample proportionally: Review a higher percentage of high-risk outputs and a lower percentage of low-risk outputs.
Use the results to improve: Track which categories have the highest error rates and use that data to improve the system (better prompts, additional validation, new tools) and adjust sampling rates.

The anti-pattern is either reviewing everything (unsustainable at scale) or reviewing randomly without risk stratification (misses the highest-risk outputs). Stratified sampling gives you the best quality assurance coverage for a given human review budget.

Common Domain 5 Exam Patterns

Context Degradation Diagnosis: The exam describes an agent that becomes less accurate or starts "forgetting" information over long conversations. The diagnosis involves identifying the lost-in-the-middle effect and recommending appropriate mitigation: repositioning critical information to the top of context, trimming verbose tool outputs, using the scratchpad pattern for persistent facts, or delegating to subagents to keep per-agent context manageable.

Escalation Policy Design: Given a business scenario (e.g., customer service, financial advisory, medical triage), design the escalation policy. The correct answer always includes the three valid triggers (explicit human request, ambiguous policy, no progress) and excludes invalid triggers (sentiment analysis, self-reported confidence). The exam may present tempting answer choices that include sentiment-based escalation with a disclaimer like "as an additional safety measure." These are still incorrect because they introduce unreliable signals into the escalation logic.

Recovery Scenario: The exam describes a long-running agent that crashes mid-operation. The correct recovery approach depends on the scenario: manifest files for batch processing operations, session resumption (--resume) for interactive sessions, and checkpoint-based recovery for multi-stage pipelines. The exam tests whether you can match the recovery pattern to the operational context.

Human Review Workflow Design: Given a production system processing a specified volume of outputs per day, design the human review workflow. The correct answer involves stratified sampling based on risk classification, with higher review rates for high-risk categories. The exam also tests whether you use the review results to improve the system (feedback loop) rather than treating human review as a static quality gate.

For a deeper dive into each domain, see our CCA-F Exam Domains Complete Breakdown, which covers each domain with additional practice scenarios and worked examples.

Seven Anti-Patterns You Must Know

Programmatic enforcement versus prompt-based guidance

The CCA-F exam places significant weight on anti-pattern recognition. These seven anti-patterns appear repeatedly across multiple domains and question types. Understanding why each is wrong is as important as knowing the correct alternative.

Anti-Pattern 1: Prompt-Based Task Ordering

Wrong approach: Telling Claude "First complete step A, then step B, then step C" in the system prompt and hoping it follows the order.

Why it fails: Claude may skip steps, reorder them, or attempt later steps before earlier ones are complete. Natural language instructions are suggestions, not guarantees. In complex multi-step workflows, the probability of misordering increases with each additional step.

Correct approach: Use programmatic prerequisites in your orchestration code. Step B's function only executes after step A returns a success result. The ordering is enforced by your code, not by Claude's interpretation of a prompt.

Exam tip: When you see an answer choice that says "add instructions to the prompt specifying the order," it is almost always wrong. Look for the answer that implements ordering in code.

Anti-Pattern 2: Self-Reported Confidence for Escalation

Wrong approach: Asking Claude "On a scale of 1-10, how confident are you in this answer?" and escalating to a human when the confidence is below a threshold.

Why it fails: LLM self-assessed confidence is poorly calibrated. Claude may report high confidence for incorrect answers and low confidence for correct ones. The confidence score is essentially another generation from the model, not an independent measurement of accuracy.

Correct approach: Use programmatic escalation triggers: the user explicitly requests a human, the applicable policy is ambiguous, or the agent has made no progress after N attempts. These are observable, measurable conditions rather than self-reported assessments.

Exam tip: Any answer choice involving self-assessed confidence, self-reported certainty, or sentiment-based escalation is a red flag. Look for answers with concrete, programmatic triggers.

Anti-Pattern 3: Batch API for Blocking Workflows

Wrong approach: Using the Batch API for operations where downstream processes are waiting for the results, such as in a request-response pipeline where the user is waiting for output.

Why it fails: The Batch API provides results within 24 hours but offers no SLA within that window. Your results might come back in 3 minutes or 20 hours. Any workflow that blocks on Batch API results is unpredictable and potentially very slow.

Correct approach: Use the real-time API for blocking workflows. Reserve the Batch API for background processing where the 50% cost savings justify the unpredictable latency: document processing overnight batches, data enrichment pipelines, content generation queues.

Exam tip: When a scenario mentions cost savings as the primary motivation for using the Batch API, check whether the workflow is blocking. If it is, the cost savings do not justify the latency risk.

Anti-Pattern 4: Larger Context Equals Better Attention

Wrong approach: Stuffing as much information as possible into the context window, believing that more context gives Claude more to work with and therefore better results.

Why it fails: The lost-in-the-middle phenomenon means that information in the middle of large contexts receives less attention. Additionally, larger contexts slow down inference, increase costs, and increase the chance of the model being distracted by irrelevant information.

Correct approach: Use per-file or per-section passes. Instead of putting all 50 documents in a single context, process each document (or small batch) in a separate API call with a focused prompt. Then aggregate the results. For analysis tasks that require cross-document comparison, use a two-pass approach: extract key facts from each document individually, then compare the extracted facts in a synthesis pass.

Exam tip: When a scenario involves large volumes of information and accuracy problems, look for answers that reduce context size per call rather than answers that increase the context window.

Anti-Pattern 5: Silent Empty Results on Failure

Wrong approach: When a tool call fails (network error, authentication failure, timeout), returning an empty result set as if the search simply found nothing.

Why it fails: Claude interprets empty results as "the search completed successfully but found no matches." This leads Claude to confidently tell the user "no results were found" when in reality the search failed to execute. The user receives a confident but completely wrong answer.

Correct approach: Return structured errors that distinguish between access failures and empty results. An access failure should include the error type, whether it is retryable, and contextual information. An empty result should explicitly confirm that the search completed successfully but returned no matches. These are fundamentally different outcomes, and Claude needs to know which one occurred.

Exam tip: When you see a scenario where Claude gives confidently wrong "no results" answers, the root cause is almost always that tool failures are being masked as empty results.

Anti-Pattern 6: All Tools Available to All Agents

Wrong approach: Making every tool available to every agent in the system, giving each agent maximum flexibility.

Why it fails: As discussed in Domain 2, tool routing accuracy degrades significantly above 18 tools, and the optimal range is 4-5 tools per agent. Giving all tools to all agents means every agent is operating above the optimal range, leading to routing errors across the entire system.

Correct approach: Scope tools to the agents that need them. Use the hub-and-spoke architecture where the coordinator routes to specialized subagents, each with a focused set of 4-5 tools. A database subagent gets database tools. An email subagent gets email tools. Neither needs the other's tools.

Exam tip: When a scenario describes a flat architecture with many tools and routing problems, look for answers that introduce specialization and tool scoping, not answers that improve tool descriptions (though better descriptions might help marginally).

Anti-Pattern 7: Prompt-Only JSON Enforcement

Wrong approach: Relying solely on prompt instructions like "Always respond in valid JSON format" or "Return your answer as a JSON object with the following fields" to ensure structured output.

Why it fails: Prompts are probabilistic, not deterministic. Claude will usually follow JSON formatting instructions, but occasionally produces malformed JSON, missing fields, extra fields, or wrong data types. In production at scale, "usually" is not good enough. Even a 1% failure rate means thousands of errors per day in a high-volume system.

Correct approach: Use tool_use JSON schemas for syntactic enforcement (guarantees correct structure), then add programmatic validation for semantic correctness, then implement a retry loop that sends validation errors back to Claude for correction. This three-layer approach (schema + validation + retry) provides production-grade reliability.

Exam tip: When a scenario describes intermittent JSON formatting problems, look for answers that add schema enforcement and validation, not answers that "improve the prompt instructions." Prompt improvement is the weakest solution.

Study Timeline

A structured study plan is essential for CCA-F preparation. Based on the experiences of successful candidates and the breadth of material covered, we recommend an 8-week preparation timeline for candidates with existing Claude experience. Candidates who are new to Claude may need 10-12 weeks.

Foundations

Weeks 1-2

•Complete Anthropic Academy courses: Claude 101 and Building with the Claude API
•Read the Claude API reference documentation end-to-end
•Review the Agent SDK getting started guide
•Build a simple single-agent application using the API
•Study the agentic loop pattern and implement stop_reason checking
•Review the MCP protocol overview documentation

Deep Dive

Weeks 3-4

•Study Domain 1 in depth: build a hub-and-spoke multi-agent system
•Study Domain 2: implement 3-4 custom tools with proper descriptions and error handling
•Study Domain 3: configure CLAUDE.md hierarchy, skills, and rules files
•Study Domain 4: build a structured output pipeline with tool_use schemas and validation
•Study Domain 5: implement context management with scratchpad pattern
•Complete Anthropic Academy courses: Prompt Engineering and Claude Code

Hands-On Practice

Weeks 5-6

•Build a production-grade project combining all 5 domains
•Implement crash recovery with manifest files
•Create a parallel task processing system with error handling
•Practice configuring MCP servers with environment variable expansion
•Build a Batch API pipeline for background document processing
•Implement escalation logic with proper programmatic triggers

Exam Preparation

Weeks 7-8

•Take practice tests on Preporato and review incorrect answers thoroughly
•Study all 7 anti-patterns until you can explain why each is wrong
•Review the programmatic vs prompt-based distinction across all domains
•Take a second round of practice tests and track improvement
•Review any domains where practice test scores are below 80%
•Complete a final full-length practice exam under timed conditions
•Rest the day before the exam

Phase-by-Phase Breakdown

Phase 1: Foundations (Weeks 1-2) focuses on building core knowledge. If you already use the Claude API regularly, this phase might go quickly. The key is ensuring you have no gaps in the fundamentals: API mechanics, the agentic loop, basic tool use, and MCP concepts. Do not rush this phase even if you feel comfortable, because the exam tests nuances that casual users might miss.

Phase 2: Deep Dive (Weeks 3-4) is where you go domain by domain, building focused projects for each. The emphasis is on hands-on implementation, not just reading documentation. You should build working examples of each major pattern: hub-and-spoke, structured output with validation, CLAUDE.md configuration, and context management. This phase is the most intensive and requires dedicated time.

Phase 3: Hands-On Practice (Weeks 5-6) integrates everything into larger, more realistic projects. The goal is to encounter and solve the kinds of problems that appear in the exam's production scenarios. This is where you develop the practical intuition that the exam tests. Build something that breaks and fix it. Encounter context degradation and manage it. See tool routing failures and diagnose them.

Phase 4: Exam Preparation (Weeks 7-8) focuses on practice tests, review, and filling gaps. Practice tests are essential for calibrating your understanding and identifying areas where you need more study. Do not just take practice tests; review every incorrect answer thoroughly to understand why the correct answer is correct and why the others are wrong. Pay special attention to anti-pattern questions.

Master These Concepts with Practice

Our CCA-F practice bundle includes:

6 full practice exams (390+ questions)
Detailed explanations for every answer
Domain-by-domain performance tracking

Try 15 Free Questions Get Full Access - $19.99

30-day money-back guarantee

Preparation Resources

The CCA-F draws on several categories of resources. Here is a comprehensive guide to what is available and how to use each resource effectively.

Anthropic Academy

Anthropic Academy (hosted on Skilljar) offers free courses covering the Claude platform. These are the closest thing to "official study material" for the CCA-F exam. Courses particularly relevant to CCA-F include:

Claude 101: Core concepts, API basics, and fundamental usage patterns.
Building with the Claude API: Deep dive into the API, including tool use, streaming, and advanced parameters.
Prompt Engineering: Comprehensive prompt engineering techniques with Claude-specific best practices.
Claude Code: Configuration, workflows, and development patterns.
Agent SDK: Building agentic applications with the official SDK.
MCP (Model Context Protocol): Understanding and implementing MCP integrations.

All courses are free and self-paced. We strongly recommend completing all courses relevant to the exam domains, even if you consider yourself experienced. The courses contain specific implementation details and best practices that appear directly on the exam.

Official Documentation

Anthropic's official documentation is the most authoritative reference for exam preparation:

Claude API Reference: Complete API documentation including all parameters, response formats, and error codes.
Agent SDK Documentation: Building agents, the agentic loop, multi-agent patterns, and session management.
Claude Code Documentation: CLI usage, configuration, CLAUDE.md, skills, rules, and CI/CD integration.
MCP Specification: Protocol details, server configuration, and tool definition formats.

Practice Tests

Practice tests are the single most effective preparation tool for the CCA-F exam. They help you calibrate your knowledge, identify gaps, practice time management, and get comfortable with the question format.

Preporato's CCA-F practice tests are designed to mirror the actual exam in format, difficulty, and domain coverage. Each practice test includes detailed explanations for every answer, helping you understand not just what the correct answer is but why it is correct and why the alternatives are wrong.

CCA-F Preparation Resources Compared

Resource	Type	Cost	Best For
Anthropic Academy	Video courses + labs	Free	Building foundational knowledge across all domains
Official Documentation	Reference docs	Free	Deep-dive into specific APIs, parameters, and configurations
Preporato Practice Tests	Timed practice exams	Subscription	Exam simulation, gap identification, time management practice
Claude API (hands-on)	Building projects	API usage costs	Developing practical intuition through implementation
Community Forums	Discussion + Q&A	Free	Edge cases, real-world experiences, study group collaboration

How to Use Resources Effectively

The most effective preparation strategy combines all resource types in a specific order:

Start with Anthropic Academy courses to build foundational understanding.
Read official documentation to fill in details and nuances.
Build hands-on projects to develop practical skills and intuition.
Take practice tests to identify gaps and calibrate your knowledge.
Return to documentation and building to address the gaps identified by practice tests.
Take more practice tests to verify that the gaps are closed.

This cycle of learn, build, test, and iterate is far more effective than any single study method alone.

Building Effective Study Habits

Based on feedback from candidates who have passed CCA-F, here are study habits that correlate with exam success:

Active Building Over Passive Reading: Candidates who spend more than 50% of their study time building working projects score significantly higher than those who primarily read documentation. The exam tests applied knowledge, and building is the best way to develop that knowledge. When you encounter a concept in the documentation, immediately build a small project that demonstrates it.

Spaced Repetition for Anti-Patterns: The seven anti-patterns are tested from many angles on the exam. Rather than studying them once and moving on, review them regularly throughout your preparation. Create flashcards or a review sheet and revisit them every few days. By exam day, you should be able to name all seven anti-patterns and their correct alternatives without hesitation.

Study Group Benefits: If possible, study with other candidates. Explaining concepts to others is one of the most effective ways to solidify your own understanding. Discussion also surfaces edge cases and nuances that you might miss studying alone. Online communities and forums dedicated to CCA-F preparation can serve this purpose if you do not have local study partners.

Practice Under Exam Conditions: At least twice during your preparation, take a full-length practice test under realistic exam conditions: timed, without external resources, and in a quiet environment. This builds your test-taking stamina and helps you calibrate your time management strategy.

For a condensed study plan if you have less time available, see our CCA-F 30-Day Study Plan, which compresses the essential preparation into an intensive 4-week schedule.

For complete details on what to expect on exam day, including the check-in process, proctoring software, and environment requirements, see our guide on CCA-F Exam Format and Structure.

Exam Day Strategy

A focused exam workspace ready for the proctored CCA-F exam

Knowing the material is necessary but not sufficient. The CCA-F exam also requires effective test-taking strategy, particularly time management and question-analysis technique.

Time Management

With 60 questions and 120 minutes, you have an average of 2 minutes per question. However, the time requirement varies significantly by question type:

Knowledge-check questions (30-60 seconds): These test specific facts, like "What is the optimal number of tools per agent?" Use them to bank time.
Scenario-analysis questions (2-3 minutes): These present a situation and ask for the best approach. Read carefully but do not overthink.
Production scenario questions (3-5 minutes): These are the complex, multi-part questions based on the 4 randomly selected scenarios. They require careful reading and analysis.

A recommended approach:

First pass (70-80 minutes): Work through all 60 questions. Answer the ones you know confidently. Flag the ones you are unsure about.
Second pass (30-40 minutes): Return to flagged questions. With the pressure of the "first look" removed, you often find that the answer is clearer on the second reading.
Final review (10 minutes): Check for any unanswered questions or obvious mistakes.

Reading Scenarios Carefully

The production scenario questions are the highest-value and highest-difficulty questions on the exam. When you encounter one:

Read the entire scenario before looking at the question. Understand the system architecture, the problem, and the constraints.
Identify the domain being tested. Is this about agentic architecture, tool design, configuration, prompt engineering, or reliability? The domain tells you which principles to apply.
Look for anti-patterns in the scenario description. Many scenario questions describe a system with a specific problem and ask you to diagnose it. The problem is almost always one of the seven anti-patterns.
Note the constraints. Production scenarios include constraints like latency requirements, cost budgets, or compliance requirements that eliminate certain answer choices.

The Elimination Technique

When you are not sure of the answer, eliminate wrong choices rather than trying to identify the right one:

Eliminate prompt-based solutions when programmatic alternatives are available. This single heuristic eliminates 1-2 choices on many questions.
Eliminate anti-patterns. If an answer choice implements one of the seven anti-patterns, eliminate it.
Eliminate over-engineered solutions. If an answer is technically correct but far more complex than necessary, it is usually not the best answer. The exam favors practical, production-appropriate solutions.
Eliminate solutions that violate context isolation. If an answer assumes that subagents inherit context, it is wrong.

The Programmatic Tiebreaker

When you have narrowed it down to two choices and cannot decide, apply this heuristic: choose the more programmatic solution. Throughout the CCA-F exam, programmatic enforcement is preferred over prompt-based guidance. If one answer implements logic in code and the other adds instructions to a prompt, the code-based answer is almost always correct.

This heuristic reflects the exam's core philosophy: production systems need deterministic behavior, and prompts are probabilistic. Code gives you determinism; prompts give you probability.

Handling Uncertainty

The CCA-F exam is designed so that some questions have answer choices that are all partially correct. In these cases, you are looking for the BEST answer, not the only correct answer. Here are strategies for these difficult questions:

Production appropriateness: An answer might be technically correct but inappropriate for production. For example, parsing natural language to detect tool calls is technically possible but inappropriate for production because stop_reason is more reliable. Always prefer the production-appropriate answer.
Completeness: An answer that addresses all aspects of the problem is better than one that addresses only part of it. For example, an answer that includes both schema enforcement AND validation is better than one that includes only schema enforcement, even though schema-only is technically a partial solution.
Simplicity: When two answers are equally correct and equally complete, prefer the simpler one. Over-engineering is a real risk in AI systems, and the exam rewards practical solutions over complex ones.

Managing Exam Anxiety

The CCA-F is a challenging exam, and some anxiety is normal. Here are practical strategies:

Do not spend more than 4 minutes on any single question during your first pass. Flag it and move on. You can always come back.
Trust your preparation. If you have followed the 8-week study plan and are scoring above 80% on practice tests, you are ready.
Read the full question before looking at answers. Many candidates skim the question and jump to answers, which leads to misunderstanding what is actually being asked.
Watch for absolute language. Answer choices that say "always," "never," or "guaranteed" are often wrong (though not always). The CCA-F rewards nuanced thinking.

For more detailed exam strategies, including per-question-type tactics and scenario-specific approaches, see our guide on How to Pass CCA-F on Your First Attempt.

CCA-F vs Other AI Certifications

Understanding how CCA-F compares to other AI certifications helps you decide whether it is the right credential for your career goals and how it fits into a broader certification strategy.

CCA-F vs Other AI Certifications

Aspect	CCA-F (Anthropic)	NCP-AAI (NVIDIA)	AWS AI Practitioner
Focus	Claude production architecture	Agentic AI systems (vendor-agnostic)	AWS AI/ML services overview
Level	Intermediate-Advanced	Intermediate	Foundational
Questions	60	Varies	85
Duration	120 minutes	Varies	120 minutes
Passing Score	720/1000	70%	700/1000
Cost	$99	$200+	$100
Validity	2 years	2 years	3 years
Prerequisites	None (experience recommended)	None (experience recommended)	None
Key Differentiator	Deep vendor-specific production patterns	Broad agentic AI design and deployment	AWS service breadth and basic concepts
Best For	Engineers building with Claude in production	Engineers designing multi-agent systems	Professionals seeking AI foundations on AWS
Hands-On Focus	Very high (production scenarios)	Moderate to high	Low (conceptual focus)
Anti-Pattern Testing	Heavy emphasis	Moderate emphasis	Minimal

How They Complement Each Other

These certifications are not mutually exclusive. In fact, they complement each other well:

CCA-F proves deep expertise with a specific platform (Claude). It is ideal if you are building production systems on Claude and want to demonstrate architectural mastery.
NCP-AAI proves broad understanding of agentic AI principles across platforms. It is ideal if you work with multiple AI providers or want to demonstrate platform-agnostic design skills.
AWS AI Practitioner proves foundational understanding of AI/ML concepts and AWS services. It is ideal as a starting point for professionals entering the AI field from other domains.

A strong certification portfolio for an AI architect in 2026 might include CCA-F (for Claude depth), NCP-AAI (for agentic breadth), and a cloud-specific certification (AWS, Azure, or GCP) for infrastructure context.

Which Should You Pursue First?

If you are already building with Claude, start with CCA-F. The exam tests practical skills you are developing daily, and the credential has immediate value in the Claude ecosystem.

If you are working with multiple AI platforms or are more interested in agentic AI design principles than a specific vendor, start with NCP-AAI.

If you are new to AI/ML and need foundational knowledge, start with AWS AI Practitioner or a similar foundational certification before tackling the more advanced CCA-F or NCP-AAI.

Common Questions

Conclusion

The Claude Certified Architect (CCA-F) certification represents a significant milestone in the AI certification landscape. It is the first certification that truly tests production-grade architecture skills with a foundation model platform, going far beyond API knowledge to validate practical, applied expertise in building reliable, scalable AI systems.

The exam's emphasis on anti-pattern recognition, programmatic enforcement over prompt-based guidance, and production scenario analysis makes it one of the most rigorous and practical AI certifications available today. Passing CCA-F demonstrates to employers and clients that you do not just know how to use Claude; you know how to architect with Claude at a production level.

Whether you are an AI engineer looking to formalize your Claude expertise, a consultant seeking to differentiate yourself in the market, or an architect transitioning into AI-first system design, CCA-F provides a clear, recognized validation of your skills. The $99 investment and 8 weeks of preparation time yield a credential that can significantly impact your career trajectory in the rapidly growing Claude ecosystem.

Your next steps are clear: start with the Anthropic Academy courses, follow the 8-week study timeline, take practice tests to calibrate your knowledge, and register for the exam when you are consistently scoring above 80% on practice tests. The CCA-F journey is demanding but rewarding, and the demand for certified Claude architects is only growing.

0/15 completed

Ready to start preparing? Explore our CCA-F practice tests to assess your current knowledge level and identify the domains where you need the most study. Combined with Anthropic Academy's free courses and hands-on building, practice tests are the fastest path to CCA-F certification.

Ready to Pass the CCA-F Exam?

Join thousands who passed with Preporato practice tests

Start Practicing Now - $19.99

Instant access30-day guaranteeUpdated monthly