eval-skills

Solid

Audit all skills in the current project for frontmatter completeness, effort level appropriateness, allowed-tools scoping, and content quality. Produces a scored report with effort-level recommendations for each skill. Use when onboarding to a new project, reviewing skill quality before shipping, or adding effort fields to an existing skill library.

AI & Automation 4,608 stars 615 forks Updated 2 days ago CC-BY-SA-4.0

Install

View on GitHub

Quality Score: 96/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Skill Evaluator Discover all skills in the project, score them across 6 criteria, and infer the appropriate `effort` level based on content analysis. ## When to Use - New project: run once to establish baseline quality - Before committing a skill to a team repo - After bulk-importing skills from another project - When adding `effort` fields for the first time (v2.1.80+) ## What Gets Audited All `SKILL.md` files and flat `.md` files found in: - `.claude/skills/**` - `~/.claude/skills/**` (if requested) - Any path passed as argument: `/eval-skills ./my-skills-dir` --- ## Pre-Check: Use the Official Validator Before scoring manually, run the official CLI validator first — it catches structural issues in seconds: ```bash # Install (one-time) uv tool install skills-ref # Validate a skill directory skills-ref validate ./my-skill # Validate all skills in project find .claude/skills -name "SKILL.md" -exec dirname {} \; | xargs -I{} skills-ref validate {} ``` If `skills-ref` passes cleanly, proceed to quality scoring below. --- ## Scoring Criteria (14 pts per skill) | # | Criterion | Max | What is checked | |---|-----------|-----|-----------------| | 1 | **name** | 1 | Present, lowercase, hyphens only, matches directory name | | 2 | **description** | 2 | Present + has "Use when" / "when to" / trigger phrasing | | 3 | **allowed-tools** | 2 | Present + not overly broad (Bash without scoping when read-only) | | 4 | **effort** | 3 | Present (1pt) + appropriate for content...

Details

Author
FlorianBruniaux
Repository
FlorianBruniaux/claude-code-ultimate-guide
Created
4 months ago
Last Updated
2 days ago
Language
TypeScript
License
CC-BY-SA-4.0

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

skill-evaluator

Evaluates agent skills against Anthropic's best practices. Use when asked to review, evaluate, assess, or audit a skill for quality. Analyzes SKILL.md structure, naming conventions, description quality, content organization, and identifies anti-patterns. Produces actionable improvement recommendations.

385 Updated 4 months ago
gotalab
Code & Development Listed

skill-auditor

Audits and validates skill definitions for quality, completeness, and best practices. Use when reviewing existing skills for proper formatting, evaluating if skills should be split into sub-skills, or ensuring skills follow established conventions.

335 Updated today
aiskillstore
Data & Documents Listed

skill-quality-eval

Validate and audit quality of AI agent slash commands (.claude/commands/*.md) — checks YAML frontmatter, description length, jargon detection, directive language, and duplicate detection. Use when creating, editing, or auditing project commands and skills. Run proactively after adding new commands.

1 Updated today
citedy
AI & Automation Solid

skills-eval

Evaluate Claude skill quality through auditing. Use when reviewing or auditing skills.

297 Updated today
athola
Code & Development Listed

skill-health-check

Validate skill quality against routing, progressive loading, and verification criteria. Use when: auditing skills, checking skill health, reviewing skill design. Not for: code review (use codex-code-review) or doc review (use doc-review). Output: health report with per-skill ratings + Gate.

1 Updated today
hmj1026