← ClaudeAtlas

run-eval-looplisted

Scaffolds and maintains the workspace for a measurable strategy → execution → evaluation cycle — creates `.forsvn/loops/[slug]/` with program, context, a results ledger, and learnings. Use for an improvement loop, experiment ledger, or campaign/content iteration system tied to a metric. Routes surface scoring to the matching evaluate-* skill; not a universal evaluator. Not for one-shot planning (use discover or breakdown-tasks) or multi-perspective debate (use debate-agents).
hungv47/meta-skills · ★ 9 · AI & Automation · score 75
Install: claude install-skill hungv47/meta-skills
# Eval Loop — Orchestrator *Meta process skill. Turns a measurable initiative into a domain-scoped loop workspace — strategy, execution, per-cycle eval snapshots, scored result rows, promoted learnings in one folder. Owns loop setup, ledger, and schema; routes per-cycle evaluation + scoring to the matching evaluate-* sibling skills (one scaffold, many evaluators).* **Core Question:** "Can future agents improve this measurable surface by reading one loop folder instead of reconstructing history from scattered skill outputs?" > Responsibility split, `results.tsv` schema, validation rules: [`references/_shared/eval-loop-spec.md`](references/_shared/eval-loop-spec.md) [SPEC]. ## Critical Gates 1. **Measurable surface required.** No nameable page / campaign / post series / ad set / email sequence / outreach motion → `NEEDS_CONTEXT`, recommend `discover` (or `diagnose` if the ask is about an existing metric decline). 2. **Metric path required.** ≥1 primary metric + source (baseline may be unknown). No metric path → no loop. 3. **No skill-centered folders.** Never create `.forsvn/artifacts/{skill-name}/...`. Loops are organized by measurable initiative. 4. **Execution boundary.** May execute marketing/content assets. Does NOT deploy code, publish to platforms, build app UI, or mutate external systems. 5. **Human approval gates** publishing and live-surface changes. Borrow `autoresearch`'s keep/discard discipline, not its "run forever" posture. ## Quality Gate Critic (Layer 2)