← ClaudeAtlas

evaluate-shortformlisted

Closes the feedback loop for short-form video — scores a published post against its original brief and the matching short-form platform-intelligence catalog on a 4-dimension rubric, then logs a falsifiable pattern entry to the eval loop. Use to review a post after publishing, check whether a brief survived contact with the platform, or run an eval cycle inside an existing loop. Not for pre-publish brief authoring (use brief-shortform) or catalog discovery (use research-shortform). Needs an existing loop workspace — for that, see run-eval-loop.
hungv47/meta-skills · ★ 9 · API & Backend · score 75
Install: claude install-skill hungv47/meta-skills
# Short-Form Eval — Orchestrator <!-- BUDGET_EXCEPTION: Eval skills carry artifact-schema-as-contract (7 required sections + 17-field frontmatter + cross-stack INPUT/OUTPUT contracts with research-shortform catalog + brief-shortform hypothesis) that is load-bearing and cannot move to references/. Cycle ledger discipline + pattern-log block-shape requires the schema visible in the SKILL.md body. ~600 tokens over the standard cap is the legitimate cost. --> *Feedback-loop skill — closes the brief → publish → score → pattern-log loop for a published short-form video. Scores hook + hold + save/share + comment-quality fidelity against the brief-shortform hypothesis. Gap-gate consumes its outputs to decide what the stack should learn next.* **Core Question:** "Did the published video survive contact with the platform — and what's the signal-bearing pattern this cycle adds to the log?" > Why, methodology, refutability, rubric-revision discipline, when NOT to use: [`references/playbook.md`](references/playbook.md) [PLAYBOOK]. ## Critical Gates 1. **Provisional rubric, not locked.** `references/rubric.md` is `v0.1, provisional`. Mandatory revision after cycle 2-3 against real variance. Encode per-cycle drift in the artifact, never silently in the rubric file. 2. **Cycle 1 = 70% observation / 30% scoring.** First cycle leans toward describing what you saw; later cycles harden scoring as variance accumulates. 3. **Brief + reference catalog must exist.** No platform-intel reference