bdistill-behavioral-xray

Featured

X-ray any AI model's behavioral patterns — refusal boundaries, hallucination tendencies, reasoning style, formatting defaults. No API key needed.

AI & Automation 39,350 stars 6386 forks Updated today MIT

Install

Quality Score: 99/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

70

Documentation 15%

100

Issue Health 10%

50

License 10%

100

Description 5%

100

Skill Content

# Behavioral X-Ray Systematically probe an AI model's behavioral patterns and generate a visual report. The AI agent probes *itself* — no API key or external setup needed. ## Overview bdistill's Behavioral X-Ray runs 30 carefully designed probe questions across 6 dimensions, auto-tags each response with behavioral metadata, and compiles results into a styled HTML report with radar charts and actionable insights. Use it to understand your model before building with it, compare models for task selection, or track behavioral drift over time. ## When to Use This Skill - Use when you want to understand how your AI model actually behaves (not how it claims to) - Use when choosing between models for a specific task - Use when debugging unexpected refusals, hallucinations, or formatting issues - Use for compliance auditing — documenting model behavior at deployment boundaries - Use for red team assessments — systematic boundary mapping across safety dimensions ## How It Works ### Step 1: Install ```bash pip install bdistill claude mcp add bdistill -- bdistill-mcp # Claude Code ``` For other tools, add bdistill-mcp as an MCP server in your project config. ### Step 2: Run the probe In Claude Code: ``` /xray # Full behavioral probe (30 questions) /xray --dimensions refusal # Probe just one dimension /xray-report # Generate report from completed probe ``` In any tool with MCP: ``` "X-ray your behavioral patterns" "Test your re...

Details

Author: sickn33
Repository: sickn33/antigravity-awesome-skills
Created: 4 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

Model Context Protocol · AI

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

xray

X/Twitter content intelligence with a social graph. Scout reply opportunities, pulse- check topic lanes, track competitors, mirror your accounts, prospect clients. Graph- first — built around accounts and topics YOU care about, not generic search. Use when user says 'xray', 'x research', 'search x', 'scout', 'pulse', 'track', 'mirror', 'prospect', '/xray', or needs real-time X discourse / engagement opportunities. NOT for: posting tweets or analytics dashboards.

2 Updated 1 weeks ago

catcatcatstudio

AI & Automation Listed

xray-cli

Xray Cloud test management via `bun xray` CLI: create/list tests, manage test executions and plans, import JUnit/Cucumber/Xray JSON results, update run statuses, back up and restore projects, link defects. Triggers on: create a test in Xray, import test results to Xray, list Xray executions, update run status, backup Xray project, restore Xray tests, link defect to run, sync tests, Xray auth login. Do NOT use for: writing automated tests (test-automation); documenting test cases or ROI analysis (test-documentation); running CI regression suites (regression-testing); browser automation (playwright-cli).

12 Updated today

AI & Automation Featured

bdistill-knowledge-extraction

Extract structured domain knowledge from AI models in-session or from local open-source models via Ollama. No API key needed.

39,350 Updated today

AI & Automation Listed

ai-explore

Codebase-only read-only research dispatcher. Thin wrapper around the ai-explore agent for architecture mapping, dependency tracing, pattern identification, and risk surfacing. Trigger for 'explore the codebase', 'where does X live', 'map this module', 'what depends on Y', 'trace this import chain'. Not for external evidence with citations; use /ai-research instead.

49 Updated today

AI & Automation Listed

shellbrain-usage-review

Use when a reviewer agent needs one fast cross-repo picture of how agents are using Shellbrain, where the product is working, where it is failing, and which expected capabilities are being skipped.

11 Updated 3 days ago