skill-evaluatorlisted

Evaluates agent skills against Anthropic's best practices. Use when asked to review, evaluate, assess, or audit a skill for quality. Analyzes SKILL.md structure, naming conventions, description quality, content organization, and identifies anti-patterns. Produces actionable improvement recommendations.
gotalab/skillport · ★ 386 · AI & Automation · score 83

Install: claude install-skill gotalab/skillport

# Skill Evaluator (WIP) Evaluates skills against Anthropic's official best practices for agent skill authoring. Produces structured evaluation reports with scores and actionable recommendations. ## Quick Start 1. Read the skill's SKILL.md and understand its purpose 2. Run automated validation: `scripts/validate_skill.py <skill-path>` 3. Perform manual evaluation against criteria below 4. Generate evaluation report with scores and recommendations ## Evaluation Workflow ### Step 1: Automated Validation Run the validation script first: ```bash scripts/validate_skill.py <path/to/skill> ``` This checks: - SKILL.md exists with valid YAML frontmatter - Name follows conventions (lowercase, hyphens, max 64 chars) - Description is present and under 1024 chars - Body is under 500 lines - File references are one-level deep ### Step 2: Manual Evaluation Evaluate each dimension and assign a score (1-5): #### A. Naming (Weight: 10%) | Score | Criteria | |-------|----------| | 5 | Gerund form (-ing), clear purpose, memorable | | 4 | Descriptive, follows conventions | | 3 | Acceptable but could be clearer | | 2 | Vague or misleading | | 1 | Violates naming rules | **Rules**: Max 64 chars, lowercase + numbers + hyphens only, no reserved words (anthropic, claude), no XML tags. **Good**: `processing-pdfs`, `analyzing-spreadsheets`, `building-dashboards` **Bad**: `pdf`, `my-skill`, `ClaudeHelper`, `anthropic-tools` #### B. Description (Weight: 20%) | Score | Criteria | |-------|--