structured investigation protocol
A 5-phase review protocol that forces systematic coverage through pre-commitment predictions, multi-perspective investigation, explicit gap analysis, confidence-gated self-audit, and pragmatic severity calibration.
Before reading the work in detail, predict the 3-5 most likely problem areas based on type and domain. This activates deliberate search — instead of passively reading and reacting, the reviewer enters with specific hypotheses to confirm or reject.
Every technical claim is verified against the actual codebase. No assertion is taken on trust. The approach adapts to the artifact type:
The work is re-examined from distinct angles adapted to what's being reviewed:
Starts in THOROUGH mode. Escalates to ADVERSARIAL if serious problems surface during Phases 2–4 — actively hunting for hidden issues and expanding scope.
This is the phase that makes the skill useful. The reviewer explicitly searches for what isn't there — the omissions, the unconsidered, the conveniently skipped:
A metacognitive check on all findings before finalizing. Each CRITICAL/MAJOR finding is assessed for confidence level, refutability, and whether it's a genuine flaw vs. a stylistic preference. Low-confidence findings are moved to Open Questions rather than scored sections.
Pragmatic severity calibration for CRITICAL and MAJOR findings. Each high-severity finding is pressure-tested with four questions:
Findings where real-world impact doesn't match the label get downgraded. Every downgrade must include a "Mitigated by: ..." statement explaining the real-world factor that justifies the lower severity. Recalibrations are reported in the Verdict Justification.
Findings are compared against pre-commitment predictions, calibrated for severity, and assembled into a structured verdict. The synthesis explicitly tracks which predictions were confirmed, which were wrong, and what was found that wasn't predicted.
Every review produces a report with these exact sections