content-moderation-api

Solid

Content moderation API integration using OpenAI Moderation, Perspective API, and others

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
97
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
53
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Content Moderation API Skill ## Capabilities - Integrate OpenAI Moderation API - Set up Perspective API for toxicity detection - Configure moderation thresholds - Implement content filtering pipelines - Design moderation response handling - Create moderation logging and reporting ## Target Processes - content-moderation-safety - system-prompt-guardrails ## Implementation Details ### Moderation APIs 1. **OpenAI Moderation**: Hate, violence, self-harm, sexual content 2. **Perspective API**: Toxicity, insult, profanity, threat 3. **Azure Content Safety**: Text and image moderation 4. **LlamaGuard**: Open-source safety classifier ### Configuration Options - API credentials and endpoints - Category thresholds - Action policies (block, warn, flag) - Logging configuration - Fallback behavior ### Best Practices - Set appropriate thresholds - Handle edge cases gracefully - Log moderation decisions - Regular threshold review - Multi-layer moderation ### Dependencies - openai - google-cloud-language (Perspective) - azure-ai-contentsafety

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Integrates with

Related Skills