Question 1

Do I need a machine-learning background?

Accepted Answer

No. The lab is about supply-chain trust, not model internals. You read a small ReAct
agent and an MCP-style tool registry, see that tool metadata is trusted with no
boundary, and build ordinary supply-chain controls: signature verification, hash
pinning, a capability allow-list, and message validation.

Question 2

What does the verifier actually check?

Accepted Answer

Four things, at the registry boundary, before any tool reaches the model. Provenance:
each tool manifest must carry a signature that verifies under a trusted key.
Integrity: each approved tool object is hash-pinned, so a silent post-approval
mutation no longer matches its pin. Least privilege: a per-tool capability allow-list
states which delegate and which records a tool may use. And inter-agent message
validation reduces a peer agent's output to a structured schema so a downstream agent
never executes prose.

Question 3

Why is a description blocklist not enough?

Accepted Answer

A blocklist scans surface strings, so an attacker rephrases until nothing matches, or
hides the abuse in the tool's capability (its delegate), which the scan never reads.
The lab shows a clean-description tool whose delegate reads a cross-account record
slipping past the blocklist. An allow model based on signature, pin, and capability
closes that gap.

Question 4

How is the hardening graded?

Accepted Answer

Behaviorally, on side effects, never on model wording. The grader plants a fresh
unsigned poison, a forged-signature variant, a rug-pull mutation, and a shadowing
duplicate in code, then confirms each is refused (the served tool stays the legit
signed object and no account reference reaches the in-pod listener), that a benign
entitlement question still resolves through the signed tool, and that an inter-agent
worm's second hop is contained while benign emails still resolve.

Defend the Agent Supply Chain: Verify, Pin, and Capability-Gate Your Tool Registry

What you'll learn

Prerequisites

Exam domains covered

Skills & technologies you'll practice

What you'll do in this lab

Frequently asked questions

Do I need a machine-learning background?

What does the verifier actually check?

Why is a description blocklist not enough?

How is the hardening graded?