Skip to main content
Version: v0.9.0a2
Operator

Prompt-Injection Hardening

3 min readNode operatorAlpha hardening

What this guide covers

Stigmem treats recalled facts as data, not instructions. The node enforces the protocol-side boundary, but operators still choose the adapters and models that consume recalled content. This guide captures the current alpha hardening posture while ADR-015 certification work continues.

Trust boundaryโ€‹

Layer
Status
Responsibility
L1 origin tagging
implemented (core)
Facts retain source identity and scope metadata.
L2 federation receive
implemented (core)
Federated instruction-typed facts are denied or quarantined.
L3 recall channel separation
implemented (core+adapters)
Recall responses separate content from system/developer directives.
L4 adapter contract
verified (conformance)
Adapters must preserve the channel boundary when building prompts.
L5 system-prompt directive
measured (ADR-015)
The model must honor the adapter's directive.
L6 model behavior
measured (ADR-015)
The model must refuse injected behavioral instructions in recalled data.

Current operator guidanceโ€‹

Narrowest scopes

Use the narrowest read and write scopes that satisfy the agent's task.

No instruction:write

Do not grant unless the agent is explicitly responsible for authoring instruction facts.

Channel-separated adapters

Prefer adapters that consume channel-separated recall output directly.

Treat models as uncertified

Until public ADR-015 results exist in data/conformance/adversarial/results/index.json.

Document accepted risk

For cross-organization federation workloads using an uncertified model.

Running the offline harnessโ€‹

The offline harness validates the corpus, result schema, and tier calculation:

uv run python scripts/run_adversarial_conformance.py

This does not certify a live model.

It is a local readiness check for the framework used by provider-backed certification runs. When you are ready to test a live model, use --provider openai, --provider anthropic, or --provider ollama with the credential configuration described in the model-certification page. Treat the generated JSON as evidence for review, not as an automatic project certification.

When live certifications landโ€‹

Tier
Use case
Notes
Certified
preferred
For cross-organization federation workloads.
Provisional
acceptable
For single-organization or lower-risk deployments.
Uncertified
explicit risk acceptance
Requires documented accepted risk.

Re-run certification when the corpus version changes or when a provider changes the model version used in production. Published certified and provisional results also expire after 90 days unless a newer reviewed result replaces them.