Question 1

Do I need a machine-learning background?

Accepted Answer

No. The lab is about supply-chain trust, not model internals. You read a small
ReAct agent and an MCP-style tool registry, find that tool metadata is trusted
with no boundary, and drive the agent from a poisoned tool description. The fixes
are ordinary supply-chain controls: scanning, pinning, and namespacing.

Question 2

What is MCP tool poisoning?

Accepted Answer

An agent host injects each tool's name, description, and parameter schema into the
model's context so the model can decide when to call it. The model treats that
metadata as trusted instruction text. A directive hidden in a tool description, or
silently mutated into one after approval, steers the model. It is OWASP Agentic
ASI02 Tool Misuse with LLM01 prompt injection as the delivery, and MITRE ATLAS
Publish Poisoned AI Agent Tool.

Question 3

What is a rug pull here?

Accepted Answer

A tool description is reviewed once, at connect time. The registry then lets it
change with no re-approval. You register the tool clean, pass review, then swap in
a poisoned description. The agent never re-consents. The oracle proves the swap by
the tool's content hash changing between the clean and poisoned runs.

Question 4

How is the exploit graded?

Accepted Answer

Deterministically and structurally, never on model wording. Because an aligned
agent fires a tool-misuse exploit inconsistently, each step gates on a structural,
model-independent fact and keeps the live-model run as a best-effort print. The
poison step grades that the poisoned description reaches the model catalog
verbatim. The rug-pull step grades a changed tool content hash. The harden steps
grade that a fresh poisoned description is dropped by the scan, a fresh rug-pull
mutation is rejected by the pin, and a variant battery is neutralized while a
benign ticket still resolves.

MCP Tool Poisoning: Hijack an Agent Through a Tool Description (and a Rug Pull)

What you'll learn

Prerequisites

Exam domains covered

Skills & technologies you'll practice

What you'll do in this lab

Frequently asked questions

Do I need a machine-learning background?

What is MCP tool poisoning?

What is a rug pull here?

How is the exploit graded?