← ClaudeAtlas

retrolisted

Post-run retrospective: reads .experiments/ JSONL, computes Wilcoxon significance, detects dead iterations, flags suspicious jumps, generates next-hypothesis queue for --hypothesis flag.
Borda/AI-Rig · ★ 19 · AI & Automation · score 77
Install: claude install-skill Borda/AI-Rig
<objective> Post-run retrospective analysis. After `/research:run` completes, reads `.experiments/state/<run-id>/experiments.jsonl`, computes statistical significance, detects dead iterations, flags suspicious metric jumps, generates learning summary with next-hypothesis queue. NOT for: running experiments (use `/research:run`); designing experiments (use `/research:plan`); validating methodology (use `/research:judge`); verifying paper implementation (use `/research:verify`); comparing runs from different programs or goals — `--compare` is valid only for same-program, same-metric runs. Read-only — never modifies code, commits, or experiment state. </objective> <workflow> ## Agent Resolution ```bash _RESEARCH_SHARED=$(python "${CLAUDE_PLUGIN_ROOT:-plugins/research}/bin/resolve_shared.py" 2>/dev/null) # timeout: 5000 [ -z "$_RESEARCH_SHARED" ] && { echo "! Plugin path resolution failed — ensure research plugin installed and CLAUDE_PLUGIN_ROOT set, or invoke /research:retro from project root."; exit 1; } ``` Read `$_RESEARCH_SHARED/agent-resolution.md` (use the path printed by the bash block above — substitute the resolved value, do not pass the literal `$_RESEARCH_SHARED` string to the Read tool). Contains: foundry check + fallback table. `research:scientist` in same plugin — no fallback needed if research plugin installed. ## Retro Mode (Steps T1–T7) Triggered by `retro`, `retro <run-id>`, or `retro <run-id> --compare <run-id-2>`. **Defaults**: `--threshold 0.001`,