confidencelisted

Rates confidence that the current work fully solves the stated requirement. Supports plan validation, code review, and analysis (root-cause, refactor, diagnose) modes. Plan mode combines LLM judgment with deterministic rule checks (multi-signal gate); a failed rule caps the gate at 89% regardless of LLM score. Use before committing to autonomous execution, after implementation, or during investigation. Triggers on "confidence check", "validate plan", "rate confidence", "quality gate", "/confidence".
mthines/agent-skills · ★ 5 · Code & Development · score 83

Install: claude install-skill mthines/agent-skills

# Confidence Assessment Rate your confidence that the current work fully solves the stated requirement. > **Multi-signal evaluation.** A single LLM-confidence number is unreliable as > a stand-alone gate (token probability ≠ correctness). This skill combines > the LLM's dimensional scoring with **deterministic rule checks** the agent > must run alongside. The final score is gated on BOTH passing. ## Contents - [Mode Detection](#mode-detection) - [Assessment Dimensions](#assessment-dimensions) - [For `plan` mode](#for-plan-mode) — multi-signal: LLM scoring + rule checks (89% cap on failure) - [For `code` mode](#for-code-mode) - [For `analysis` mode](#for-analysis-mode) - [Output Format](#output-format) - [Score Thresholds](#score-thresholds) - [Iteration Protocol (plan mode)](#iteration-protocol-plan-mode) - [Auto-Fix (Fix Mode Only)](#auto-fix-fix-mode-only) --- ## Mode Detection Check the arguments: `$ARGUMENTS` | Argument | Default | Validates | When to use | | ---------------- | ------- | ------------------------------------------------------------------ | ---------------------------------------------------------------------------- | | `plan` | | Implementation plan completeness | After Phase 1 planning, before autonomous execution | | `code` | *