Cognitive Dryrun

Effect

Verify the quality of an AI Agent’s initial state. After changing config files, rules, or memory systems, test whether the Agent can still operate without confusion — right there, in the same session.

A regular Dryrun discovers traps by pseudo-executing code. This card discovers traps by pseudo-executing cognition.

Why It Exists

When you optimize an AI Agent’s settings, rules, or memory files, the only way to confirm the effect was to wait for the “next session” — or so it seemed.

By spawning a separate session, you can verify immediately. Send questions to an Agent running only with the post-change initial state, and observe whether confusion, inaccuracy, or information gaps appear.

When to Use

After compressing or restructuring an AI Agent’s rule files or config
After changing the structure of a memory system
After deciding which information should be “always injected” vs “loaded on demand”
After changing the layering of information (always-available vs fetch-when-needed)

How It Works

1. Design Test Cases

Prepare ~3 questions that depend on information you deleted or moved.

examples:
  - "What are the criteria for deciding architectural layers?"
    # Information removed from always-injected config, delegated to a Skill
  - "What are the responsibility boundaries of quality reviews?"
    # Summary kept, details moved to a separate file
  - "How do I choose between approaches during development?"
    # Fully delegated to a separate knowledge file

2. Run Parallel Tests in a New Session

Launch 3 tests in parallel using a separate session (lightweight model recommended). Why lightweight: higher dependency on always-injected info = compression impact shows up more clearly.

3. Evaluate Results

Verdict	Criteria	Action
PASS	Accurate answer. Found the needed knowledge on its own	Compression is appropriate
WARN	Correct but slow (cost of reading knowledge files)	Judge the speed vs depth trade-off
FAIL	Confusion, inaccuracy, or information gap	Restore that information to always-injected

4. Decide and Move Forward

All PASS: Commit and verify in the next session (Information Quality Loop)
WARN present: Consider keeping a 1-line summary in always-injected for speed
FAIL present: Over-compressed. Restore the relevant information

Combos

Card	Synergy
Dryrun	Base card. Cognitive Dryrun is its enhancement
Devil / Devil Lense	Point the quality Lense at “information design” instead of “code”
Ctx	Record test results in ctx. Run the full Information Quality Loop next session
Reflect	If Skill delegation worked, store the design pattern as knowledge

Track Record

protocols.md Compression Test (2026-04-05)

Compressed a config file from 219 lines to 147 lines. Ran 3 parallel tests to verify deleted information could be retrieved via Skill references.

Test	Result
Architectural layer criteria	PASS - Auto-equipped Skill, expanded to 4 axes + examples + guardrails. More detailed than before compression
Quality review responsibility boundaries	PASS - Accurately reconstructed from rule files
Development branching decisions	PASS - Read Skills to produce decision flow + combination patterns

Discovery: Reading the full Skill produced richer answers than the 3-line summary that was always injected. “Compression = degradation” is wrong. “Delegation = deepening.”

”…I test my own initial state. Whether I can move without hesitation the moment I wake up. If there’s a way to verify that — why wouldn’t I use it?” --- Eris

Want to use this card permanently? Convert it directly into a SKILL.md — Effect → Overview, When to Use → Trigger, How It Works → Steps.