nw-abr-critique-dimensions

Featured

Review dimensions for validating agent quality - template compliance, safety, testing, and priority validation

Testing & QA 508 stars 53 forks Updated 5 days ago MIT

Install

View on GitHub

Quality Score: 97/100

Stars 20%
90
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Agent Quality Critique Dimensions Use these dimensions when reviewing or validating agent definitions. ## Dimension 1: Template Compliance Does the agent follow official Claude Code format? **Check**: YAML frontmatter with name and description (required) | Markdown body as system prompt | No embedded YAML config blocks | No activation-instructions or IDE-FILE-RESOLUTION sections | Skills referenced in frontmatter, not inline **Severity**: High -- non-compliant agents may not load correctly. ## Dimension 2: Size and Focus **Check**: Core definition under 400 lines | Domain knowledge in Skills | Single clear responsibility | No monolithic sections (>50 lines without structure) | No redundant Claude default behaviors **Measurement**: `wc -l {agent-file}`. Target: 200-400 lines. **Severity**: High -- oversized agents suffer context rot. ## Dimension 3: Divergence Quality Does the agent specify only what diverges from Claude defaults? **Check**: No file operation instructions | No generic quality principles ("be thorough") | No tool usage guidelines | Core principles are domain-specific and non-obvious | Each instruction justifies why Claude wouldn't do this naturally **Severity**: Medium -- redundant instructions waste tokens, cause overtriggering. ## Dimension 4: Safety Implementation **Check**: Tools restricted via frontmatter `tools` field | maxTurns set | No prose-based security layers (use hooks) | No embedded enterprise safety frameworks | permissionMode se...

Details

Author
nWave-ai
Repository
nWave-ai/nWave
Created
3 months ago
Last Updated
5 days ago
Language
Python
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category