prompt-injection-detector

Solid

Prompt injection detection and prevention for secure LLM applications

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
97
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
54
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Prompt Injection Detector Skill ## Capabilities - Detect prompt injection attempts - Implement input sanitization - Configure detection classifiers - Design defense layers - Implement canary token detection - Create injection logging and alerting ## Target Processes - prompt-injection-defense - tool-safety-validation ## Implementation Details ### Detection Methods 1. **Pattern Matching**: Known injection patterns 2. **ML Classifiers**: Trained injection detectors 3. **Canary Tokens**: Detect instruction override 4. **LLM-Based**: Use LLM to detect manipulation 5. **Perplexity Analysis**: Unusual input patterns ### Defense Strategies - Input preprocessing - Prompt structure design - Output validation - Sandboxed execution - Multi-layer defense ### Configuration Options - Detection threshold - Pattern rules - Classifier model - Action policies - Alerting settings ### Best Practices - Defense in depth - Regular pattern updates - Monitor false positives - Test with red-team inputs ### Dependencies - rebuff (optional) - transformers - Custom classifiers

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Related Skills