confidencelisted
Install: claude install-skill mthines/agent-skills
# Confidence Assessment
Rate your confidence that the current work fully solves the stated requirement.
> **Multi-signal evaluation.** A single LLM-confidence number is unreliable as
> a stand-alone gate (token probability ≠ correctness). This skill combines
> the LLM's dimensional scoring with **deterministic rule checks** the agent
> must run alongside. The final score is gated on BOTH passing.
## Contents
- [Mode Detection](#mode-detection)
- [Assessment Dimensions](#assessment-dimensions)
- [For `plan` mode](#for-plan-mode) — multi-signal: LLM scoring + rule checks (89% cap on failure)
- [For `code` mode](#for-code-mode)
- [For `analysis` mode](#for-analysis-mode)
- [Output Format](#output-format)
- [Score Thresholds](#score-thresholds)
- [Iteration Protocol (plan mode)](#iteration-protocol-plan-mode)
- [Auto-Fix (Fix Mode Only)](#auto-fix-fix-mode-only)
---
## Mode Detection
Check the arguments: `$ARGUMENTS`
| Argument | Default | Validates | When to use |
| ---------------- | ------- | ------------------------------------------------------------------ | ---------------------------------------------------------------------------- |
| `plan` | | Implementation plan completeness | After Phase 1 planning, before autonomous execution |
| `code` | *