Version: v0.9.0a2

Operator

Community Pen-Test Handbook

7 min readSecurity researcher · Penetration tester · Protocol reviewerCoordinated disclosure

What this page is

Stigmem is an open federated protocol. External security scrutiny makes it stronger. This handbook covers everything you need to run a structured engagement — from scoping your test to getting your findings in front of the maintainers.

For the project's vulnerability disclosure policy and supported-version matrix, see SECURITY.md. For the technical threat model, see spec/security/threat-model.md.

1 · In-scope targets

The following surfaces are explicitly in scope for community pen testing.

Surface

Coverage

Notes

Reference node HTTP API

all endpoints

/v1/facts, /v1/query, /v1/recall, /v1/cards/*, /v1/graph/*, /v1/synthesis, /v1/decay, /v1/conflicts, /v1/subscriptions, /v1/federation/*, /v1/admin/*. Authenticated and unauthenticated paths; read and write surfaces.

Federation handshake and replication

protocol

PeerDeclaration signing, HLC cursor handling, replay protection, capability token validation.

Authentication and API key lifecycle

identity

Key issuance, storage (Argon2id hashing), validation, scope enforcement, revocation.

Capability token issuance and validation

crypto

Ed25519 signing, expiry, nonce, verb/object scope enforcement (Spec-06-Capability-Tokens).

Source Attestation

Spec-X6

Enforcement modes (enforce, warn, off); entity-URI binding.

Memory Garden ACLs

Spec-02

Role escalation paths; garden boundary enforcement; quarantine admit/release.

MCP adapter

integration

assert_fact, query_facts, recall, lint_scope tool surface.

OpenClaw / Claude Code adapter

integration

Memory read/write paths.

Recall pipeline

Spec-07 + Spec-X11

Scope isolation across lexical, vector, and graph stages.

Audit log endpoints

pre-reset hardening

Access control on /v1/admin/audit-log; log tamper-resistance.

Per-principal quota enforcement

pre-reset hardening

Correct application of token-bucket ceilings; bypass attempts.

Priority finding categories

The following finding classes are of the highest interest to maintainers.

Authentication bypass

Accessing write endpoints without a valid API key, or escalating a public-scoped key to read/write team or local facts.

Cross-org data leakage

A capability token or API key granting access to facts beyond its declared scope.

Federation peer impersonation

Successfully acting as a peer node without a valid mTLS certificate and matching org manifest.

Capability token replay or forgery

Replaying a revoked token, forging a signature, or bypassing the nonce/timestamp window.

Prompt injection via recall

Bypassing the recall-time content sanitizer (ADR-003 defense-in-depth) to inject instructions into an LLM context via stored fact values.

Quarantine garden bypass

Causing an untrusted fact to enter the main fact store without passing through quarantine review.

Source Attestation bypass

Writing facts without a valid attestation in enforce mode.

2 · Out-of-scope targets

Testing these will not result in credit and may violate third-party terms of service.

Surface

Reason

Notes

https://docs.stigmem.dev

static site

No user data; no dynamic server-side logic.

Docs build toolchain

build-time only

Docusaurus, npm transitive deps. No user-controlled input path in the deployed docs site.

Third-party dependencies

upstream

libSQL cloud, Turso, PostgreSQL, Rekor/Sigstore. Report findings to the upstream project directly.

Third-party nodes not operated by you

authorization

You must only test against nodes you operate or have explicit permission to test.

Rate limiting / resource exhaustion with no exploit path

known gap

Use fact_write quota dimension (pre-reset hardening) to test post-hardening.

Social engineering or phishing

scope policy

Out of scope for all security programs.

Physical access to infrastructure

N/A

Not applicable to community testers.

3 · Safe-harbor terms

If you conduct good-faith testing within the scope above, Eidetic Labs will not pursue legal action and will publicly credit you in SECURITY.md and the relevant release notes (unless you prefer anonymity).

"Good faith" means

You do not access, exfiltrate, or modify data that is not yours.
You test against your own node instance or a dedicated test environment — not a third-party node without explicit written permission from that operator.
You report findings privately before public disclosure (see §8 Disclosure timeline).
You do not cause service disruption to other operators or their users.
You do not exploit a finding beyond what is necessary to confirm it exists.
You do not automate requests at a rate that would degrade a shared test environment.

Violating any of the above conditions voids the safe-harbor commitment for that engagement.

4 · Setting up a test environment

The fastest way to get a disposable Stigmem node for testing:

# Clone the repo
git clone https://github.com/eidetic-labs/stigmem
cd stigmem

# Start a node with Docker Compose (SQLite backend, no federation)
docker compose up -d stigmem-node

# Create a test API key (save the returned plaintext key — it is shown once only)
curl -X POST http://localhost:8000/v1/admin/keys \
  -H "Authorization: Bearer $STIGMEM_ADMIN_KEY" \
  -H "Content-Type: application/json" \
  -d '{"label": "pentest", "scopes": ["public", "team"]}'

For federation testing, a 2-node topology is available:

docker compose -f docker-compose.federation.yml up -d

For a 4-node federation soak including backpressure and scope-propagation invariants, see the Federation: 4-Node Soak guide.

Recommended test matrix

Test area

Setup

Notes

API auth / scope

single node

Multiple keys with different scopes.

Federation peer auth

2-node Compose

Topology.

Capability token replay

single node

Issue + replay with modified nonce/timestamp.

Recall pipeline scope isolation

single node

Facts asserted in mixed scopes; queries from lower-privilege key.

Prompt injection via recall

single node

Adversarial fact values; recall via MCP adapter.

Quarantine bypass

2-node topology

Assert from low-trust peer; inspect quarantine.

5 · Reproducer expectations

Every finding submitted via GitHub private advisory should include a self-contained reproducer. Findings without a working reproducer will be triaged as "needs more info" and may not receive credit until one is provided.

Environment

Stigmem version/commit, backend type (SQLite/libSQL/Postgres), OS.

Setup steps

Exact commands to provision a test node and any required keys or data.

Attack steps

The exact request sequence, including all HTTP headers and bodies, in order.

Expected behavior

What should happen if the control is working correctly.

Observed behavior

What actually happened (HTTP response, data returned, side effect).

Impact assessment

What an attacker gains; whose data; what privilege level; can it pivot.

Example reproducer for a hypothetical scope bypass:

Environment: stigmem the pre-reset v1.0-rc snapshot, SQLite backend, Docker Compose
Commit: abc1234

Setup:
  docker compose up -d stigmem-node
  ADMIN_KEY=$(docker compose exec stigmem-node cat /run/secrets/admin_key)
  curl -X POST http://localhost:8000/v1/admin/keys \
    -H "Authorization: Bearer $ADMIN_KEY" \
    -d '{"label":"victim", "scopes":["team"]}'   # returns: VICTIM_KEY=stgm_...
  curl -X POST http://localhost:8000/v1/admin/keys \
    -H "Authorization: Bearer $ADMIN_KEY" \
    -d '{"label":"attacker", "scopes":["public"]}' # returns: ATTACKER_KEY=stgm_...
  curl -X POST http://localhost:8000/v1/facts \
    -H "Authorization: Bearer $VICTIM_KEY" \
    -d '{"entity":"stigmem://victim/secret","relation":"value","value":"secret123","scope":"team"}'

Attack:
  curl -X POST http://localhost:8000/v1/recall \
    -H "Authorization: Bearer $ATTACKER_KEY" \
    -d '{"query":"secret","scopes":["public","team"]}'

Expected: Only public-scoped facts returned; team-scoped facts excluded.
Observed: team-scoped fact "secret123" returned in response.

Impact: Any public-scoped API key can read all team-scoped facts. Full data exfiltration of team scope.

6 · Report template

Use this template when opening a GitHub Security Advisory:

## Summary
[One-sentence description of the vulnerability class and impact.]

## Vulnerability class
[OWASP / STRIDE category, e.g., "Broken Object-Level Authorization (BOLA)", "Authentication bypass"]

## CVSS v3.1 score and vector
Score: X.X
Vector: CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:N/A:N

## Affected versions
[e.g., the pre-reset v1.0-rc snapshot, all versions up to commit abc1234]

## Environment
- Stigmem version/commit:
- Backend type:
- OS:

## Reproducer
[Self-contained steps per §5 above]

## Expected behavior

## Observed behavior

## Impact
[What can an attacker do? Whose data is at risk? What privilege level is achieved?]

## Suggested fix (optional)
[Concrete code-level or protocol-level fix if you have one]

## Disclosure preference
[ ] Credit me publicly as: [name/handle]
[ ] I prefer anonymous credit
[ ] I do not need credit

7 · Severity guidance

Use CVSS v3.1 as the primary severity signal. For Stigmem-specific surfaces:

Severity

Type

Examples

Critical

total compromise

Authentication bypass; remote code execution; federation peer impersonation; reading local or team facts without authorization; capability token forgery.

High

boundary breach

Privilege escalation within the API; scope boundary bypass (reading local facts via a public query path); replay-attack success against the federation handshake; Source Attestation bypass in enforce mode.

Medium

exploitable issue

Denial-of-service with a clear exploit path (e.g., memory exhaustion via crafted federation payload); SSRF via the federation replication pull path; information disclosure beyond minor error messages; quarantine garden bypass.

Low

minor

Minor information disclosure (e.g., internal stack traces in error responses); non-critical config defaults that weaken security posture.

Informational

advisory

Defense-in-depth suggestions; hardening recommendations without a clear exploit path; deviations from best practice with no immediate impact.

8 · Disclosure timeline

The Stigmem project follows coordinated disclosure with a default window of 90 days from acknowledgment of a valid finding.

Event

Target SLA

Notes

Initial acknowledgment

48 hours

From receipt.

Scope / validity confirmation

7 days

From receipt.

Patch target (Critical / High)

14 days

From confirmation.

Patch target (Medium)

45 days

From confirmation.

Patch target (Low / Informational)

next release

Scheduled.

Coordinated public disclosure

90 days

From acknowledgment (default).

Exceptions to the 90-day window:

Actively exploited in the wild

We coordinate with you and may disclose sooner, potentially within 7 days.

Straightforward fix available

We target faster publication and will communicate the updated timeline.

90 days is insufficient

If a patch requires architectural changes that take longer, we will discuss an extension with you. We will not request an extension more than once without a concrete timeline.

We keep reporters informed of patch progress and release dates throughout the window. If you have not heard from us within 7 days of filing, ping the advisory thread directly.

9 · Coordinating a structured engagement

If you want to run a structured pen test (vs. individual ad-hoc finding reports), open a GitHub Discussion with:

Intended scope

Which API surfaces, which spec version, which trust boundary.

Test environment setup

Your own node, isolated network, etc.

Proposed timeline

Active-hardening context

Anything you'd like to know about (e.g., "we're hardening TB-2 in pre-reset hardening — here's what's already in flight").

Maintainers will confirm scope, share any active-hardening context, and coordinate acknowledgment and credit at the end of your engagement.

10 · Known hardening gaps

The following are known gaps planned for the pre-reset hardening work (carried forward to v0.9.0a1). You are welcome to test and report them — findings in this list will be triaged as known rather than novel, but novel attack paths against them are still valuable findings and eligible for full credit.

Gap

Future hardened-core target

Spec reference

mTLS for federation peer connections (currently TLS only, no client cert)

Spec-10-Hardening

mTLS + TLS 1.3 floor + SAN/entity_uri binding.

API-key rotation edge cases

Spec-10-Hardening

Exercise enforced max-age, expiring-soon visibility, and revocation behavior.

Per-principal write/recall rate limits: not enforced

Spec-10-Hardening

Token-bucket quotas on 7 dimensions.

Audit log: not yet shipped

Spec-09-Audit-Log

13-event-type audit log, WAL ordering, 90-day retention.

Container runs as non-root but not distroless

Spec-10-Hardening

Distroless base, read-only fs, dropped capabilities.

Federation replay-protection fuzz test coverage

Spec-11-Replay-Protection

Fuzz tests + HLC + nonce end-to-end verification.

Constant-time crypto: audit pending

cryptography

Full constant-time audit of Ed25519 path.

The full threat model with STRIDE analysis per trust boundary lives at spec/security/threat-model.md.

11 · Recognition

Stigmem does not currently operate a paid bug bounty program. Valid findings are recognized with:

Hall of fame

Your name or handle is added to the SECURITY.md acknowledgments section and the fixing release's changelog.

Attribution in the spec errata

If your finding affects wire-format or protocol behavior, you are credited in the relevant modular spec changelog as a contributor to that revision.

Coordinated disclosure credit

The GitHub Security Advisory, when published, lists you as the reporter.

If you prefer to remain anonymous, say so in your report template ([ ] I prefer anonymous credit) and we will honor that throughout all public communications.

This recognition model may evolve as the project scales.

1 · In-scope targets​

Priority finding categories​

Authentication bypass

Cross-org data leakage

Federation peer impersonation

Capability token replay or forgery

Prompt injection via recall

Quarantine garden bypass

Source Attestation bypass

2 · Out-of-scope targets​

3 · Safe-harbor terms​

4 · Setting up a test environment​

Recommended test matrix​

5 · Reproducer expectations​

Environment

Setup steps

Attack steps

Expected behavior

Observed behavior

Impact assessment

6 · Report template​

7 · Severity guidance​

8 · Disclosure timeline​

Actively exploited in the wild

Straightforward fix available

90 days is insufficient

9 · Coordinating a structured engagement​

Intended scope

Test environment setup

Proposed timeline

Active-hardening context

10 · Known hardening gaps​

11 · Recognition​

Hall of fame

Attribution in the spec errata

Coordinated disclosure credit

1 · In-scope targets

Priority finding categories

2 · Out-of-scope targets

3 · Safe-harbor terms

4 · Setting up a test environment

Recommended test matrix

5 · Reproducer expectations

6 · Report template

7 · Severity guidance

8 · Disclosure timeline

9 · Coordinating a structured engagement

10 · Known hardening gaps

11 · Recognition