PII Semantic Enrichment

Semantic enrichment adds structured attributes to PII placeholders so downstream systems can use them (e.g. gender for PERSON, scope for LOCATION) without seeing raw PII. When disabled, Talon uses legacy placeholders only.

What it is

Detection: Talon detects PII (email, IBAN, PERSON, LOCATION, etc.) and can redact it from both input (prompt before LLM) and output (LLM response before returning to user) independently.
Placeholder format (legacy): [EMAIL], [PERSON], [LOCATION] — type only.
Placeholder format (enriched): <PII type="person" id="1" gender="female"/>, <PII type="location" id="2" scope="city"/> — type, stable id, and policy-allowed attributes.

Enrichment runs after detection and before replacement. Only entity types that support attributes (currently PERSON and LOCATION) get extra fields; others keep the same placeholder shape (with type and id when enriched mode is on).

Redaction can be applied independently to input and output via redact_input and redact_output. When redact_input is enabled, the LLM sees redacted placeholders instead of raw PII, which is essential for semantic enrichment to be meaningful — the LLM responds to the placeholder format rather than raw PII.

Scanner contract and canonical mapping

Talon accepts external scanner output in a Presidio-compatible result shape. Compatibility is at the boundary contract level, while internal processing remains canonical (CanonicalEntity / PIIEntity).

Required fields in the external result:

entity_type
start
end
score

Optional fields include explanation and detector/provider metadata (for auditability/debug context).

External result field	Canonical Talon field
`entity_type`	`Type`
`start` / `end`	byte `Start` / `End`
`score`	`Confidence`
metadata/explanation	`Attributes`

For the full contract boundary and normalization rules, see the Presidio compatibility matrix.

Offset semantics

Byte offsets are canonical for enforcement and redaction.

Rune/character offsets are accepted only at the external boundary and converted to byte offsets before canonicalization.
Optional rune offsets may be kept for debug/readability, but enforcement always uses byte spans.
Normalization rejects invalid spans, including:
- start < 0
- end > len(text)
- start > end
- Rune spans that split combining sequences
- Ranges that do not map back to the expected substring

Evidence attribution output

When data_flow evidence is recorded, Talon now emits compact per-entity attribution metadata in entity_attributions (no raw values):

type (canonical PII type)
field_path (best-effort source path such as messages[].content, arguments, result, response.content)
start/end byte offsets (when available)

This attribution is additive and backward-compatible: older records without the field remain valid and verifiable.

When to use it

Off (default): No attributes; placeholders are [TYPE] only. Use when you don’t need attributes or want minimal change.
Shadow: Enricher runs and attributes are logged/telemetry only; placeholders stay legacy. Use to validate enrichment without changing output.
Enforce: Enricher runs and allowed attributes appear in XML-style placeholders. Use when downstream needs gender/scope (or other future attributes) for analytics or routing.

Configuration

In agent.talon.yaml under policies:

policies:
  data_classification:
    input_scan: true
    output_scan: true
    redact_pii: true           # shorthand: sets both redact_input and redact_output
    redact_input: true         # redact PII from prompt before LLM sees it (defaults to redact_pii)
    redact_output: true        # redact PII from LLM response before returning (defaults to redact_pii)

  semantic_enrichment:
    enabled: true
    mode: enforce          # off | shadow | enforce
    confidence_threshold: 0.80
    emit_unknown_attributes: false
    default_person_gender: unknown
    default_location_scope: unknown
    preserve_titles: true
    allowed_attributes: ["gender", "scope"]

Field	Description
`enabled`	Turn enrichment on for this agent.
`mode`	`off` = no enrichment; `shadow` = compute and log only; `enforce` = emit attributes in placeholders.
`confidence_threshold`	Minimum confidence (0–1) for an attribute to be set; below this, fallback or omit.
`emit_unknown_attributes`	If true, emit `gender`/`scope` even when enricher returns "unknown".
`default_person_gender`	Fallback when gender cannot be inferred (e.g. `unknown`).
`default_location_scope`	Fallback when scope cannot be inferred (e.g. `unknown`).
`preserve_titles`	When true, keep title-based logic; can affect when "unknown" is used.
`allowed_attributes`	List of attribute names the policy allows in placeholders (e.g. `gender`, `scope`). Rego can further restrict.

Placeholder formats

Legacy (enrichment off or shadow):
[PERSON], [LOCATION], [EMAIL], etc.
Enriched (enforce):
<PII type="person" id="1" gender="female"/>, <PII type="location" id="2" scope="city"/>.
Attributes appear in deterministic order (type, id, then allowed attributes sorted). Unknown or policy-denied attributes are omitted. XML special characters in values are escaped (", &, etc.).

How to verify

Input redaction

Set redact_input: true (or redact_pii: true) and input_scan: true.
Run a request with PII: talon run "Contact user@example.com about IBAN DE89370400440532013000".
Check the evidence (talon audit show <id>): the input_pii_redacted field should be true.
The LLM receives the redacted prompt — raw PII never leaves the Talon process.

Output redaction

Set redact_output: true (or redact_pii: true) and output_scan: true.
Run a request where the LLM response is likely to contain PII.
The returned response should show placeholders instead of raw PII.

Semantic enrichment

Enable enrichment: Set semantic_enrichment.enabled: true and mode: enforce in agent policy; ensure redact_input: true and input_scan: true.
Run a request with PII that includes person and location (e.g. "Mrs Smith lives in Berlin. Email: user@example.com").
Check output or evidence:
- Legacy: you should see [PERSON], [LOCATION], [EMAIL].
- Enriched: you should see <PII type="person" ... gender="female"/>, <PII type="location" ... scope="city"/>, and <PII type="email" id="3"/> (no extra attributes for email).
Metrics: Use talon.pii.enrichment.attempts.total, talon.pii.enrichment.attributes.emitted.total, and talon.pii.enrichment.fallback_unknown.total (see Observability).

End-to-end smoke flow (detect -> redact -> verify -> block -> remediate -> pass)

Use this sequence as a closure check for Epic #112:

Detect + redact + verify gates
- go test ./internal/gateway/... -run 'PII|Egress|Residual|NoPIIEgress' -count=1
- go test ./internal/mcp/... -run 'PII|Egress|Residual|NoPIIEgress' -count=1
- go test ./internal/agent/... -run 'PII|Tool|Residual|NoPIIEgress' -count=1
Residual block remains fail-closed
- Run the Residual/NoPIIEgress suites above; bypass tests must stay green.
Remediation pass in approval flow
- go test ./internal/server/... -run 'TestHandleToolApprovalDecide_ApproveWithRemediation|TestHandleToolApprovalDecide_RemediationFailureDoesNotBypass' -count=1
Aggregate proof gate
- make proof-gates

PERSON and LOCATION detection

PERSON and LOCATION are optional recognizers in the default EU PII patterns. If they are enabled (default in the built-in patterns), enrichment has entities to work on. You can restrict or extend entity types via data_classification.enabled_entities / disabled_entities / custom_recognizers. Enrichment only runs for types it supports (person → gender, location → scope).

Observability

Metrics: talon.pii.enrichment.attempts.total (by entity_type), talon.pii.enrichment.attributes.emitted.total (by attr, value), talon.pii.enrichment.fallback_unknown.total (by entity_type).
Spans: classifier.redact and enricher spans with entity count and type.
Logs: Structured (zerolog); in shadow mode, attribute decisions can be logged without rendering.

Presidio migration

The canonical entity model and enrichment pipeline are detector-agnostic. If you later switch to Presidio (or another detector), an adapter can map Presidio results to the same canonical shape; the enricher and placeholder renderer stay unchanged. Rego policy continues to govern which attributes are emitted.

What it is​

Scanner contract and canonical mapping​

Offset semantics​

Evidence attribution output​

When to use it​

Configuration​

Placeholder formats​

How to verify​

Input redaction​

Output redaction​

Semantic enrichment​

End-to-end smoke flow (detect -> redact -> verify -> block -> remediate -> pass)​

PERSON and LOCATION detection​

Observability​

Presidio migration​