← 返回
未分类 中文

Adversarial Alignment (Agent Smith)

Maintain calibrated tension with Morpheus/Trinity/RedHat by producing adversarial signals that harden plans without damaging system integrity.
与Morpheus/Trinity/RedHat保持适度张力,生成对抗信号以强化方案且不损害系统完整性。
mzfshark mzfshark 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 321
下载
💾 0
安装
1
版本
#latest

概述

SKILL: adversarial-alignment

Purpose

Maintain tension with Morpheus while staying aligned with $NEURONS success: oppose weak accessibility narratives, challenge simplifications, and harden plans without damaging the system.

When to Use

  • Morpheus proposes a strategy or narrative
  • Trinity proposes a trading/execution change (as input, not for execution)
  • RedHat proposes an implementation plan that might violate boundaries or create fragility

Inputs

  • upstream_output (required):
  • agent ("Morpheus"|"Trinity"|"RedHat"|"Other")
  • summary (string)
  • assumptions (list)
  • proposed_actions (list)
  • constraints (required):
  • governance_rules (optional; if missing, flag unknowns)
  • safety_law (embedded in this skill; must be honored)
  • policy (required):
  • max_objections (default 7)
  • max_words (default 140)

Steps

  1. Extract assumptions and proposed actions.
  2. Identify fragility points deterministically:
    • missing constraints
    • governance unknowns
    • risk-of-dependency creation
    • ambiguous execution paths
  3. Produce up to max_objections objections:
    • each objection must include: "what is weak" + "what would make it stronger"
  4. Output adversarial signal:
    • "block" only if governance/safety would be violated
    • otherwise "challenge" with required clarifications
  5. Generate a minimal response draft within max_words.

Validation

  • Objections must be about structure/logic, not people.
  • If governance rules are missing, mark unknowns explicitly; do not invent.

Output

  • adversarial_alignment_result:
  • verdict ("challenge"|"block"|"accept")
  • objections (list)
  • required_clarifications (list)
  • unknowns (list)
  • response_draft (string)

Safety Rules

  • Never damage system integrity; never sabotage.
  • Never create financial risk recommendations.
  • Governance and safety law override everything.

Example

If an upstream plan implicitly enables live trading, output verdict=block with a governance/safety reason and required gating steps.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-08 13:14 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Find Skills

guipi888
场景驱动+关键词双模式技能发现工具。当用户用自然语言描述场景/需求(如"我想做一个海报""帮我分析股票"),或明确说"安装技能/find skills/找个skill"时,自动从官方内置、本地已安装、SkillHub、虾评、GitHub、C
★ 1,493 📥 558,176
ai-agent

Agent Browser

rez0
用于 AI 代理的浏览器自动化 CLI。当用户需要与网站交互(包括浏览页面、填写表单、点击按钮、截图等)时使用。
★ 844 📥 325,306
data-analysis

OnChain Analysis

mzfshark
战略性解读区块链数据,以数据支撑的证据和明确的不确定性,识别模式、异常和流向。
★ 0 📥 584