subagent-testing

Solid

Test skills via TDD in fresh subagents. Use when validating behavior or preventing bias.

Testing & QA 308 stars 27 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%
83
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Subagent Testing - TDD for Skills Test skills with fresh subagent instances to prevent priming bias and validate effectiveness. ## Table of Contents 1. [Overview](#overview) 2. [Why Fresh Instances Matter](#why-fresh-instances-matter) 3. [Testing Methodology](#testing-methodology) 4. [Quick Start](#quick-start) 5. [Detailed Testing Guide](#detailed-testing-guide) 6. [Success Criteria](#success-criteria) ## Overview **Fresh instances prevent priming:** Each test uses a new Claude conversation to verify the skill's impact is measured, not conversation history effects. ## Why Fresh Instances Matter ### The Priming Problem Running tests in the same conversation creates bias: - Prior context influences responses - Skill effects get mixed with conversation history - Can't isolate skill's true impact ### Fresh Instance Benefits - **Isolation**: Each test starts clean - **Reproducibility**: Consistent baseline state - **Measurement**: Clear before/after comparison - **Validation**: Proves skill effectiveness, not priming ## Testing Methodology Three-phase TDD-style approach: ### Phase 1: Baseline Testing (RED) Test without skill to establish baseline behavior. ### Phase 2: With-Skill Testing (GREEN) Test with skill loaded to measure improvements. ### Phase 3: Rationalization Testing (REFACTOR) Test skill's anti-rationalization guardrails. ## Quick Start ```bash # 1. Create baseline tests (without skill) # Use 5 diverse scenarios # Document full responses # 2. Create...

Details

Author
athola
Repository
athola/claude-night-market
Created
6 months ago
Last Updated
today
Language
Python
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category