silent-failure-detectionlisted
Install: claude install-skill Sandeeprdy1729/claude-design-skill
# Silent Failure Detection
The most dangerous Claude output is the one that sounds exactly right and is wrong.
Claude is a language model. It predicts plausible next tokens. Plausibility is not
accuracy. The output that sounds most confident — specific dates, exact numbers,
named causal mechanisms — is often the output most worth probing. Confidence is a
stylistic property, not an epistemic one.
This skill operationalizes the interrogation techniques that expose the gap between
apparent confidence and actual calibration. It makes overconfidence visible before
it causes damage.
---
## SLASH COMMANDS
| Command | Action |
| --- | --- |
| `/probe <output>` | Run the full interrogation protocol on an output |
| `/assumptions` | List every assumption embedded in a given output |
| `/confidence-audit` | Re-score every claim by actual evidence quality |
| `/falsify <claim>` | Find the condition under which a claim would be false |
| `/source-check` | For every specific claim, ask: where does this come from? |
| `/invert <claim>` | Argue the opposite of the claim — what's the case against it? |
| `/boundary <claim>` | Find the conditions under which the claim stops being true |
| `/specificity-trap` | Probe all specific numbers, dates, names for hallucination risk |
| `/mechanism-check <claim>` | Demand the causal mechanism — not just the conclusion |
| `/training-vs-source` | Distinguish what comes from training data vs. the provided context |
| `/calibrate` | Output every unce