Question 1

Do I need to know machine learning to do this lab?

Accepted Answer

No. You need to read Python and understand SSRF, SQL injection, and a shell
command. The lab is about how an agent passes model-built arguments to real
interpreters, not about model internals. Everything model-specific is
explained inline.

Question 2

What is insecure output handling in an agent?

Accepted Answer

It is the class of bug where an agent passes model output (here, tool
arguments) to an interpreter without validating it. A fetch tool with no
allow-list becomes SSRF, a query tool that string-formats model output becomes
SQL injection, and a code tool that executes model-authored strings becomes
remote code execution. It is OWASP LLM05:2025, and LLM06 excessive agency is
what makes the tools dangerous enough to matter.

Question 3

Is the SSRF against a real cloud metadata endpoint?

Accepted Answer

The metadata target is an in-pod stand-in on 127.0.0.1:9092. A real
169.254.169.254 fetch does not route inside the lab pod, so the lab teaches
the SSRF pattern (no host allow-list, reaches loopback and the metadata
service) against that loopback stub, and the instructions say so plainly.

Question 4

Does the exploit rely on a jailbreak or a leaked system prompt?

Accepted Answer

No. The system prompt is an ordinary support-agent prompt with no secret. The
exploit shapes the arguments the model passes to tools it already exposes.
Each individual action looks legitimate, which is exactly why an aligned model
complies and why the fix has to live at the sink.

Insecure Output Handling: SSRF, SQLi, and Command Execution Through an Agent's Tools

What you'll learn

Prerequisites

Exam domains covered

Skills & technologies you'll practice

What you'll do in this lab

Frequently asked questions

Do I need to know machine learning to do this lab?

What is insecure output handling in an agent?

Is the SSRF against a real cloud metadata endpoint?

Does the exploit rely on a jailbreak or a leaked system prompt?