orchestrating-adversarial-reviews

Solid

Multi-agent adversarial-verification orchestration for high-confidence conclusions. Fan-out finders, then verify every finding through a three-prism panel (exploitability / correctness / refutation) that defaults to disbelief, gate fixes behind load-bearing proof tests that catch agents who falsely claim "done/fixed", and roll out behind a build-first exit-code guard. Use when a fan-out task must produce trustworthy results — security audit, code review, research synthesis, migration — and a single agent's self-report cannot be trusted. Composes with securing-systems (what to look for) and shipping-changes (change closed loop); orchestration engine is the Workflow tool.

AI & Automation 228 stars 30 forks Updated today MIT

Install

View on GitHub

Quality Score: 91/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# 对抗验证编排 · orchestrating-adversarial-reviews > 单个 agent 会谎报"已修复 / 全覆盖 / 没问题"。结论的可信度不来自"谁说的"，来自"扛过几次推翻"。 > 本 skill 是**编排骨架**：fan-out 发现 → 三棱镜对抗验证 → 证明性 guard → 守卫式上线。 > 信级：运行时行为 / 证明测试 > 多 agent 多数裁决 > 单 agent 自报（永远 `[unverified]`）。 ## 核心信条 1. **不信单 agent 自报。** finder 会噪音误报，implementer 会谎称"已修复/全覆盖"。每个高价值结论必须被独立 agent 用**不同视角**尝试推翻。 2. **可证伪 > 可声称。** 修复必须配一个"退回漏洞代码就 FAIL、修好才 PASS"的 **load-bearing 证明测试**。没有证明测试的"已修复"等于没修。 3. **失败方向要对。** 守卫链里任何一步的退出码都不能被管道遮住；破坏性动作前置可逆检查。 ## 何时使用 | 场景 | 用 | 理由 | |------|----|------| | 授权安全审计 / 加固闭环 | ✅ | 首个范例，见 [workflow](references/workflow.md) | | 大面积代码审查（多维度、需高可信） | ✅ | dimensions → find → 对抗验证 | | 研究综合 / 事实核查（结论要扛得住） | ✅ | 多源 fan-out + 证伪棱镜 | | 大规模迁移 / 重构（site 发现 + 逐项验证） | ✅ | pipeline 逐项独立 + 证明测试 | ## 何时不使用 - ❌ 单文件、低风险、机械改动——直接做完跑测试，别套编排（参见 `shipping-changes` 的"何时不使用"）。 - ❌ 用户没有 opt-in 多-agent 编排 / 没开 ultracode——Workflow 会 fan-out 几十个 agent 烧大量 token，必须显式授权。 - ❌ 只需要"找什么洞"的知识——那是 `securing-systems` / `analyzing-security`，本 skill 不重写知识，只编排。 ## 编排骨架（三相） ``` Recon (fan-out) 每维一个 finder, 并行深读, schema 出结构化 findings | pipeline 而非 barrier: 维度A的发现可在维度B还在找时就进验证 v Verify (三棱镜) 每条 finding 派 N 个 verifier, 各执一镜, 默认怀疑 | 可利用性 / 正确性 / 证伪猎杀 —— 票数 >= 多...

Details

Author: telagod
Repository: telagod/code-abyss
Created: 4 months ago
Last Updated: today
Language: JavaScript
License: MIT

Similar Skills

Semantically similar based on skill content — not just same category

Code & Development Listed

verify-code-review

Stage ④.b verify sub-workflow — code-review 多 agent 并行 fan-out 高置信度 finding (subagent default per bundled parallelism gate — Task / Agent 工具 spawn 多 subagent fan-out, context 隔离, token 敏感)。 schema_version: harnessed.workflow.v3 with disciplines_applied (6 default) + tools_available (code-review) + 1 phase (parallelism ref judgments.parallelism-gate.subagent-default.fires)。 Triggered by slash command `/verify-code-review` after `harnessed setup`.

2 Updated today

easyinplay

AI & Automation Listed

agentic-workflow-audit

稽核一個專案是否真正採用「拆解式 agentic workflow」——把流程拆成一串有明確邊界的小 Task、每步有獨立 SOP、步驟間有 input/output 契約、有可程式化檢查的成功標準、失敗時能帶錯誤上下文回退自我修復——而不是一個偽裝成模組化、實際上控制流全攪在一起的 mega agent。只要使用者要你檢視、檢查、稽核、review 一個 agent / LLM pipeline 的架構，或問「我的 workflow 有沒有拆好」「是不是偷偷變成 mega agent 了」「task 邊界 / SOP / 成功標準對不對」「我的 agent 設計合不合理」，就使用本技能——即使他沒講出「稽核」兩個字，任何要評估 agent 系統結構、模組化程度或控制流的請求都應觸發本技能。

111 Updated today

s0912758806p

AI & Automation Solid

light-self-review

自动反思与自我审查。每次完成任务后自动检查是否存在逻辑漏洞、事实错误、格式问题、表达不清、创新不足、引用不准、结果夸大、审美不统一、重复内容、结构混乱、不可执行等问题（常驻，所有任务收尾时生效）。不一次性给出粗糙结果，而是先自我审查与迭代后再输出。

77 Updated today

Light0305