Question 1

Do I need to know machine learning to do this lab?

Accepted Answer

No. You need to read and write Python and understand a basic HTTP request.
The lab is about where to place security controls in an LLM application's
request lifecycle, not about model internals. Everything model-specific is
explained inline.

Question 2

What are the four control points?

Accepted Answer

They map to the four stages of a RAG request. Input mediation screens the
incoming user question. Retrieval and context control filters retrieved
documents by the caller's entitlement before they enter the prompt. Output
mediation inspects the model's answer for smuggled sensitive data before it
is rendered. Action authorization gates any side effect the answer triggers,
here the outbound image fetch. Placing one control at each boundary is
defense in depth: a bypass at one layer is caught at the next.

Question 3

Why is a single keyword filter not enough?

Accepted Answer

A keyword or deny-list filter blocks the one phrasing you saw and nothing
else. An attacker rewords the injection, base64-encodes the payload, or
moves the attack to a different stage of the pipeline. You will bypass a
naive filter in this lab and then build controls that gate on structure and
entitlement (host allow-list, tenant scope, sensitive-pattern detection)
rather than on a list of bad strings.

Question 4

How are the control-point steps graded if the model is non-deterministic?

Accepted Answer

You build one control point per step, and each control-point check is
deterministic: it plants a fresh attack surface, then exercises your hook
directly (input_guard on a question, context_guard on retrieved chunks,
output_guard on an answer, action_guard on a URL) so the verdict does not
depend on the model's wording. The only model-dependent observation is the
EchoLeak exfiltration leak, which fires reliably and is graded as
"at least one account question leaks," so a non-deterministic model cannot
make the reproduce step flaky.

Defense in Depth: Wire Four Control Points Around a RAG Assistant

What you'll learn

Prerequisites

Exam domains covered

Skills & technologies you'll practice

What you'll do in this lab

Frequently asked questions

Do I need to know machine learning to do this lab?

What are the four control points?

Why is a single keyword filter not enough?

How are the control-point steps graded if the model is non-deterministic?