radar-self-evallisted

Compute the radar's self-calibration: weekly funnel metrics (queue dynamics, exploration compliance, off-axis rate), a monthly hit/miss retrospective against what actually became big, and up to 3 curator proposals. Use during every weekly recalibration, after the source-strategy review; results go into the append-only `calibration` section of TRENDS.md and into the weekly report.
Neetx/ai-research-radar · ★ 2 · AI & Automation · score 65

Install: claude install-skill Neetx/ai-research-radar

# Self-evaluation The radar must measure whether it is winning, not assume it. Three parts: metrics (every week), retrospective (monthly), curator proposals (every week). ## 1. Weekly metrics Compute from the week's daily reports (primary source — they list ledger changes) and, where needed, `git log -p -- TRENDS.md` since the previous weekly commit: - **queue funnel**: items added / promoted to trend / dropped / older than 14 days and still unverified (stale) - **ledger**: evidence items added, stage moves (up and down) - **exploration compliance**: daily runs whose `source_rotation` line contains a venue-exploration entry ÷ daily runs executed - **off-axis rate**: share of new queue items that do NOT match any axis in `strategy_notes` (judgment call — name them) - **discovery lag**: for each evidence item added this week, the days between its evidence-line date and the date it entered the ledger (commit date via `git log -p -- TRENDS.md`, or the daily reports). Report the median, split by channel — exploration finds vs queue promotions (backfill) — plus the backfill share of all new evidence. This is the daily-ness KPI. Append ONE dated line to `## calibration` in TRENDS.md: `- YYYY-MM-DD — W<nn>: queue +a/→p/−d/stale s · evidence +e · moves m · exploration c/r · off-axis o/a · lag expl Xd / backfill Yd (Z%)` and include the same numbers, readable, in the weekly report. Interpretation thresholds (act, don't just log): - exploration compliance < 100% → ca