PII Redaction Patterns Inside LLM Pipelines

Every prompt your application sends is a place personal data can leak — into a provider’s logs, your own traces, or a model’s context it should never have seen. The fix is not to avoid LLMs. It is to put a redaction layer between your data and the model, so the model only ever sees what it strictly needs.

Detect, mask, restore

The core pattern has three steps and a token map that never leaves your perimeter:

Detect entities (names, emails, card numbers) with a recogniser before the prompt is built.
Replace each with a stable placeholder token and store the mapping server-side.
After the model responds, swap placeholders back into the final output for the user.

The model reasons over [PERSON_1] and [EMAIL_1], never the real values — so even your logs and the provider’s logs stay clean while the user still sees a fully personalised reply.

A redaction layer sitting between application data and the model

Choosing a recogniser

Pattern-based: regex for structured data like cards and phone numbers — fast and exact.
Model-based: a named-entity model for names and addresses that patterns miss.
Hybrid: both, with the pattern layer as a guaranteed backstop under the model.

Hybrid is the right default for anything regulated. The model catches the messy, free-text cases; the regex layer guarantees the structured ones never slip through even if the model has an off day.

Redaction is not a feature you bolt on at the end. It is the boundary your whole pipeline is built around.

Don’t forget the output

Models can echo or infer sensitive details that were never in the prompt. Scan generated text on the way out as well as on the way in, and keep the token map in a short-lived, access-controlled store — never in the prompt history or a long-lived cache.

Pitfalls that cause leaks

Putting the token map in the prompt itself — it defeats the entire pattern.
Logging the un-redacted input "just for debugging" — that log is now in scope for compliance.
Skipping output scanning, so an inferred phone number slips through unredacted.

Get the boundary right once and every feature built on top inherits it. That is the difference between an AI roadmap your security team blocks and one they sign off on.

Frequently asked questions

In a short-lived, access-controlled server-side store scoped to the request. It must never be sent to the model or persisted in prompt history, so the mapping between placeholders and real values stays inside your perimeter.

Done well, barely. Stable, descriptive placeholders preserve the relationships the model needs to reason, and the original values are restored in the final output the user sees.

For structured data like card and phone numbers, regex is fast and exact. For names, addresses, and free text it misses too much — pair it with a named-entity model in a hybrid setup.

Scan outputs as well as inputs. A model can deduce or echo sensitive details, so output redaction is what closes the loop on a pattern that only checks the prompt.

Detect, mask, restore

The core pattern has three steps and a token map that never leaves your perimeter:

Detect entities (names, emails, card numbers) with a recogniser before the prompt is built.

Replace each with a stable placeholder token and store the mapping server-side.

After the model responds, swap placeholders back into the final output for the user.

The model reasons over [PERSON_1] and [EMAIL_1], never the real values — so even your logs and the provider’s logs stay clean while the user still sees a fully personalised reply.

Choosing a recogniser

Pattern-based: regex for structured data like cards and phone numbers — fast and exact.

Model-based: a named-entity model for names and addresses that patterns miss.

Hybrid: both, with the pattern layer as a guaranteed backstop under the model.

Hybrid is the right default for anything regulated. The model catches the messy, free-text cases; the regex layer guarantees the structured ones never slip through even if the model has an off day.

Redaction is not a feature you bolt on at the end. It is the boundary your whole pipeline is built around.

Don’t forget the output

Pitfalls that cause leaks

Putting the token map in the prompt itself — it defeats the entire pattern.

Logging the un-redacted input "just for debugging" — that log is now in scope for compliance.

Skipping output scanning, so an inferred phone number slips through unredacted.

Get the boundary right once and every feature built on top inherits it. That is the difference between an AI roadmap your security team blocks and one they sign off on.

Frequently asked questions

Done well, barely. Stable, descriptive placeholders preserve the relationships the model needs to reason, and the original values are restored in the final output the user sees.

For structured data like card and phone numbers, regex is fast and exact. For names, addresses, and free text it misses too much — pair it with a named-entity model in a hybrid setup.

Scan outputs as well as inputs. A model can deduce or echo sensitive details, so output redaction is what closes the loop on a pattern that only checks the prompt.

Gen AI

CRM

Cloud

Automation

Why most AI agents fail in production — and the framework we use instead

PII Redaction Patterns Inside LLM Pipelines

Detect, mask, restore

Choosing a recogniser

Don’t forget the output

Pitfalls that cause leaks

Frequently asked questions

Building something with AI? Let's talk.

Related articles

How RAG Architecture Is Replacing Traditional Search

Agentic AI in 2026: Why Enterprises Are Replacing Traditional SaaS Tools With AI Agents

The LLM Evaluation Harness Every Team Needs

Have a project? Let’s talk.

PII Redaction Patterns Inside LLM Pipelines

Detect, mask, restore

Choosing a recogniser

Don’t forget the output

Pitfalls that cause leaks

Frequently asked questions

Building something with AI? Let's talk.

Related articles

How RAG Architecture Is Replacing Traditional Search

Agentic AI in 2026: Why Enterprises Are Replacing Traditional SaaS Tools With AI Agents

The LLM Evaluation Harness Every Team Needs

Have a project? Let’s talk.

Detect, mask, restore

Choosing a recogniser

Don’t forget the output

Pitfalls that cause leaks

Frequently asked questions

Never miss a post.

Building something with AI? Let's talk.

Related articles

How RAG Architecture Is Replacing Traditional Search

Agentic AI in 2026: Why Enterprises Are Replacing Traditional SaaS Tools With AI Agents

The LLM Evaluation Harness Every Team Needs

Have a project? Let’s talk.

Detect, mask, restore

Choosing a recogniser

Don’t forget the output

Pitfalls that cause leaks

Frequently asked questions

Never miss a post.

Building something with AI? Let's talk.

Related articles

How RAG Architecture Is Replacing Traditional Search

Agentic AI in 2026: Why Enterprises Are Replacing Traditional SaaS Tools With AI Agents

The LLM Evaluation Harness Every Team Needs

Have a project? Let’s talk.