prompt-injection-detector

Install

View on GitHub

Quality Score: 93/100

Stars 20%

97

Recency 20%

100

Frontmatter 20%

70

Documentation 15%

54

Issue Health 10%

50

License 10%

100

Description 5%

100

Skill Content

# Prompt Injection Detector Skill ## Capabilities - Detect prompt injection attempts - Implement input sanitization - Configure detection classifiers - Design defense layers - Implement canary token detection - Create injection logging and alerting ## Target Processes - prompt-injection-defense - tool-safety-validation ## Implementation Details ### Detection Methods 1. **Pattern Matching**: Known injection patterns 2. **ML Classifiers**: Trained injection detectors 3. **Canary Tokens**: Detect instruction override 4. **LLM-Based**: Use LLM to detect manipulation 5. **Perplexity Analysis**: Unusual input patterns ### Defense Strategies - Input preprocessing - Prompt structure design - Output validation - Sandboxed execution - Multi-layer defense ### Configuration Options - Detection threshold - Pattern rules - Classifier model - Action policies - Alerting settings ### Best Practices - Defense in depth - Regular pattern updates - Monitor false positives - Test with red-team inputs ### Dependencies - rebuff (optional) - transformers - Custom classifiers

Details

Author: a5c-ai
Repository: a5c-ai/babysitter
Created: 4 months ago
Last Updated: today
Language: JavaScript
License: MIT

Install

Quality Score: 93/100

Skill Content

Details

Related Skills

videodb

ck

browser