← ClaudeAtlas

os-improvement-reportlisted

Trigger with "show me the improvement chart", "how are we improving", "progress report", "graph the eval scores", "show cycle of improvement", "what's the trend", "are we getting better". Produces a visual/text summary of how the agentic loop is improving across cycles. Do NOT use this to run the learning loop or evaluate a specific skill change.
richfrem/agent-plugins-skills · ★ 3 · Data & Documents · score 67
Install: claude install-skill richfrem/agent-plugins-skills
## Dependencies This skill requires **Python 3.8+**, `pandas`, and `matplotlib`. **To install this skill's dependencies:** ```bash pip-compile ./requirements.in pip install -r ./requirements.txt ``` See `./requirements.txt` for the dependency lockfile. --- # Loop Progress Report Visual and text reporting on the agentic loop improvement cycle — across any plugin that maintains an `improvement-ledger.md` and `results.tsv` per skill. The reference output is the autoresearch progress chart: green KEEP dots on a timeline, gray DISCARD dots, running-best step line, annotations showing what each improvement was. This skill produces the same chart for agentic-os and exploration-cycle-plugin improvement cycles. --- ## What It Reads | Source | Priority | Content | |--------|----------|---------| | `context/experiment-log/index.md` | **Primary** | All logged runs; filter `result_type: numeric` for KEEP/DISCARD/score data from orchestrator runs | | `context/memory/improvement-ledger.md` | Legacy fallback | Eval score progression written by os-improvement-loop Stage 4.7; used if experiment log has no numeric entries | | `.agents/skills/*/evals/results.tsv` | Supplement | Per-skill detailed eval score history | The experiment log is the unified source of truth for numeric results. The improvement ledger is a legacy format maintained for backward compatibility with older loop runs. --- ## What It Produces | Output | Description | |--------|-------------| | `context/memory/repo