ai-security

Solid

Use when assessing AI/ML systems for prompt injection, jailbreak vulnerabilities, model inversion risk, data poisoning exposure, or agent tool abuse. Covers MITRE ATLAS technique mapping, injection signature detection, and adversarial robustness scoring.

AI & Automation 16,392 stars 2261 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# AI Security AI and LLM security assessment skill for detecting prompt injection, jailbreak vulnerabilities, model inversion risk, data poisoning exposure, and agent tool abuse. This is NOT general application security (see security-pen-testing) or behavioral anomaly detection in infrastructure (see threat-detection) — this is about security assessment of AI/ML systems and LLM-based agents specifically. --- ## Table of Contents - [Overview](#overview) - [AI Threat Scanner Tool](#ai-threat-scanner-tool) - [Prompt Injection Detection](#prompt-injection-detection) - [Jailbreak Assessment](#jailbreak-assessment) - [Model Inversion Risk](#model-inversion-risk) - [Data Poisoning Risk](#data-poisoning-risk) - [Agent Tool Abuse](#agent-tool-abuse) - [MITRE ATLAS Coverage](#mitre-atlas-coverage) - [Guardrail Design Patterns](#guardrail-design-patterns) - [Workflows](#workflows) - [Anti-Patterns](#anti-patterns) - [Cross-References](#cross-references) --- ## Overview ### What This Skill Does This skill provides the methodology and tooling for **AI/ML security assessment** — scanning for prompt injection signatures, scoring model inversion and data poisoning risk, mapping findings to MITRE ATLAS techniques, and recommending guardrail controls. It supports LLMs, classifiers, and embedding models. ### Distinction from Other Security Skills | Skill | Focus | Approach | |-------|-------|----------| | **ai-security** (this) | AI/ML system security | Specialized — LLM injection, mo...

Details

Author
alirezarezvani
Repository
alirezarezvani/claude-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Related Skills