constitutional-ai-prompts

Solid

Constitutional AI and safety guardrail prompts for aligned LLM behavior

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
97
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
54
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Constitutional AI Prompts Skill ## Capabilities - Design constitutional AI principles - Implement self-critique and revision prompts - Create harmlessness guidelines - Design refusal patterns for unsafe requests - Implement red-team testing prompts - Create ethics-aware response frameworks ## Target Processes - system-prompt-guardrails - content-moderation-safety ## Implementation Details ### Constitutional Patterns 1. **Critique-Revision**: Self-evaluate and improve responses 2. **Principle Adherence**: Follow defined ethical principles 3. **Harmlessness Focus**: Prioritize safe responses 4. **Helpfulness Balance**: Balance helpfulness with safety 5. **Transparency**: Acknowledge limitations ### Configuration Options - Constitutional principles list - Critique prompts - Revision guidelines - Refusal templates - Escalation triggers ### Best Practices - Define clear constitutional principles - Balance helpfulness and safety - Test with adversarial inputs - Document refusal patterns - Regular principle review ### Dependencies - langchain-core

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Integrates with

Related Skills