adversarial-roleplay

Solid

Tactic: Construct detailed hostile persona, attack artifact from that persona's perspective, record successful attack paths for aggregation.

AI & Automation 331 stars 25 forks Updated today Apache-2.0

Install

View on GitHub

Quality Score: 96/100

Stars 20%
84
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Adversarial Roleplay Tactic Deploy constructed hostile personas to attack the artifact from distinct motivational frames. ## Orchestration 1. **persona-construction** builds detailed adversary profile: - Background and expertise domain - Motivation for attacking (career incentive, resource competition, ideological) - Known blind spots and biases of this persona type - Preferred attack patterns 2. **attack-vector-generation** generates vectors specific to persona's expertise and motivation 3. **probe-execution** executes attacks while maintaining persona consistency 4. Successful attack paths recorded with persona attribution 5. Process repeats for each persona (budget-limited) 6. **finding-aggregation** cross-references findings across personas for convergent vulnerabilities ## Subagents Dispatched - persona-construction (1 call per persona) - attack-vector-generation (1 call per persona) - probe-execution (N calls per persona, budget-limited) - finding-aggregation (1 call at end, cross-persona) ## Termination Conditions - All budgeted personas deployed and exhausted - Convergent vulnerability found by 2+ personas (high-confidence finding) - Single persona finds critical vulnerability (early report) - Budget exhausted (report per-persona findings separately) <!-- BEGIN available-tables (generated) --> ## Available SOPs Optional, no fixed order; the final leaf is always a sop. | SOP | When to use | | --- | --- | | attack-vector-generation | Generate sp...

Details

Author
yogsoth-ai
Repository
yogsoth-ai/de-anthropocentric-research-engine
Created
4 months ago
Last Updated
today
Language
HTML
License
Apache-2.0

Integrates with

Related Skills