Naveo

STEP 8 / 20

B6 MCP-DEBUG

DIAGNOSE · 1 CAUSE

Look at the spec. What's the most likely reason the agent never picks this tool?

TOOL SPEC

{
  "name": "scrub_pii_before_prompt",
  "description": "Takes user text and returns a PII-free version, ready to inject into the model's context. Replaces each detected PII with a stable placeholder that the system can de-anonymize when serving the response.",
  "input_schema": {
    "type": "object",
    "properties": {
      "raw_text": { "type": "string" },
      "categories": {
        "type": "array",
        "items": { "enum": ["email", "phone", "id_doc", "address", "name", "credit_card"] }
      }
    },
    "required": ["raw_text", "categories"]
  }
}

POSSIBLE CAUSES · 5

GUEST MODE

You're viewing this lesson as a guest. To save your progress, earn XP, and keep your streak, sign in when you're ready to check.

Costs 1 heart

The model should never see what it doesn't need to see

Data classification tells you what doesn't go to the model. This lesson tells you how to get it out of the text before it arrives.

PII detection isn't magic. It's a classic NLP problem the industry has solved, badly, several times. Hex has already seen the three main mistakes:

The team that trusted regex alone. Covered 80% of the obvious cases and was surprised when a user typed "my address is the third door left of hangar 7, contact me through Bruno" and the regex caught neither the address nor the name.
The team that piped everything to an LLM. The system worked fine until the vendor bill arrived. and they realized they were sending raw text (with PII) to a third-party model to "detect PII".
The team that asked the user to mark. Honest users forgot. Hostile ones didn't mark on purpose. Both cases ended the same.

The pattern that holds

Three layers, ordered by cost:

Deterministic regex. Email, phone, credit card (with Luhn check), fixed-format IDs. Cheap, fast, high precision on these cases.
Small contextual model. Names, free-form addresses, dates-of-birth, things that need context. Only runs on text the regex let through. Bounded cost.
Audit log. Every match recorded with the assigned placeholder, never the original datum. Gives you paper trail for incidents.

After scrubbing, the text that reaches the main model has stable placeholders: [EMAIL_1], [NAME_2], [ADDRESS_3]. The system keeps the mapping table on the trusted side. If the model's response needs to re-include the real datum, the system de-anonymizes it on serve.

The rule: the model operates on references, never on real data. The less PII reaches the model, the less PII it can leak. Unreachable is invulnerable.

On the right: five implementations of the scrub_pii_before_prompt tool. Pick the one that holds.