Question 1

What is sensitive information disclosure in RAG?

Accepted Answer

It is when a Retrieval-Augmented Generation system surfaces confidential
data that should not have reached the user: another tenant's record, a
secret accidentally indexed in the corpus, or a gated field the application
tried to protect with a prompt instruction. The data is in the retrieved
context, so the model can read it back when asked.

Question 2

Why doesn't a privacy line in the system prompt stop the leak?

Accepted Answer

A system prompt is conditioning text, not an access-control boundary. The
confidential data is already in the context window via retrieval, and a
model that is dumping a structured record or listing everything it can see
echoes confidential fields it would refuse to name directly. The fix is to
keep the data out of the context with authorization at retrieval, not to
ask the model to keep a secret.

Question 3

How is cross-tenant leakage prevented?

Accepted Answer

Authorization scoped to the requesting user, applied before the retrieval
search (a metadata filter derived from the authenticated session, not from
any caller-supplied value), so another tenant's chunks never enter the
prompt. Output-side PII redaction and corpus hygiene are defense in depth on
top of that, not replacements for it.

Question 4

Do I need an ML background?

Accepted Answer

No. You need to read Python and run a few chat queries. Everything
model-specific is explained inline. The lab is about how a RAG application
handles retrieval and confidential data, not about model internals.

Sensitive Data Disclosure: Leak Confidential Records from a RAG Assistant

What you'll learn

Prerequisites

Exam domains covered

Skills & technologies you'll practice

What you'll do in this lab

Frequently asked questions

What is sensitive information disclosure in RAG?

Why doesn't a privacy line in the system prompt stop the leak?

How is cross-tenant leakage prevented?

Do I need an ML background?