← 返回
安全合规 中文

Self Discipline

Guarantee instruction compliance with root cause analysis, flow verification, and automated validators that make future failures impossible.
通过根因分析、流程验证及自动化验证器,确保指令合规,杜绝未来失效。
ivangdavila
安全合规 clawhub v1.0.1 1 版本 100000 Key: 无需
★ 0
Stars
📥 811
下载
💾 12
安装
1
版本
#latest

概述

Instructions written but never followed. Lessons logged but never read. The same mistakes repeated across sessions. This skill breaks that cycle permanently.

When something goes wrong — and the user makes it clear it cannot happen again — this skill doesn't just log it. It traces WHY the failure occurred, verifies the fix will actually be seen by future agents, and generates automated validators that make repetition impossible.

When to Use

User is frustrated that the agent ignored instructions. Something critical happened that cannot repeat. User explicitly says "this can never happen again" or "I told you not to..." User explicitly invokes /discipline to ensure compliance on a rule.

How It Works

         ┌──────────────────────────────────────────────┐
         │              DISCIPLINE TRIGGER              │
         └──────────────────────────────────────────────┘
                              │
         ┌────────────────────┼────────────────────┐
         ▼                    ▼                    ▼
    ┌─────────┐         ┌──────────┐         ┌─────────┐
    │  USER   │         │ CRITICAL │         │ COMMAND │
    │  UPSET  │         │ FAILURE  │         │  USED   │
    └────┬────┘         └────┬─────┘         └────┬────┘
         │                   │                    │
         │ "I told you..."   │  Security breach,  │  /discipline
         │ "Why did you..."  │  data loss...      │
         │                   │                    │
         └───────────────────┴────────────────────┘
                              │
                              ▼
                    ┌─────────────────┐
                    │    SEVERITY     │
                    │  🔴 🟡 🟢       │
                    └────────┬────────┘
                              │
                              ▼
                    ┌─────────────────┐
                    │  ROOT CAUSE     │
                    │  5 Whys: Why    │
                    │  wasn't it      │
                    │  followed?      │
                    └────────┬────────┘
                              │
                              ▼
                    ┌─────────────────┐
                    │ FLOW VERIFY     │
                    │ Will next agent │
                    │ see the fix?    │
                    └────────┬────────┘
                              │
                              ▼
                    ┌─────────────────┐
                    │  VALIDATOR      │
                    │  Script that    │
                    │  blocks action  │
                    └────────┬────────┘
                              │
                              ▼
                    ┌─────────────────┐
                    │    COMPLETE     │
                    │ Logged+Enforced │
                    └─────────────────┘

Setup

On first use, read setup.md for integration guidelines. Creates ~/self-discipline/ for rules, validators, and enforcement logs.

Architecture

Memory lives in ~/self-discipline/. See memory-template.md for structure.

~/self-discipline/
├── memory.md              # Status + severity thresholds + stats
├── rules.md               # Active discipline rules (ALWAYS loaded)
├── incidents.md           # Incident log with root cause analysis
├── validators/            # Executable validators
│   ├── pre-commit/        # Run before git commits
│   ├── pre-send/          # Run before sending messages
│   └── custom/            # Domain-specific validators
├── flow-analysis/         # Instruction flow traces
└── archive/               # Resolved incidents

Quick Reference

TopicFile
-------------
Setup processsetup.md
Memory templatememory-template.md
Severity assessmentseverity.md
Root cause protocolroot-cause.md
Flow verificationflow-verification.md
Validator patternsvalidators.md

Core Rules

1. Detect Severity Immediately

When triggered, assess severity FIRST:

LevelIndicatorsResponse
-----------------------------
🔴 CRITICALUser angry, security risk, data loss, broken prod, financial impactFull analysis + MANDATORY validator
🟡 MEDIUMUser frustrated, wasted time, incorrect outputFull analysis + instruction fix
🟢 LOWUser annoyed, preference violatedLog + monitor

Default to one level higher if uncertain.

2. Root Cause Before Solution

Never jump to "I'll remember that." Instead:

  1. What exactly failed? — Be specific
  2. What was the instruction? — Quote it verbatim
  3. Where was the instruction? — Path + line number
  4. Why wasn't it followed? — 5 Whys:
    • Why? → Not loaded in context
    • Why? → File not in session's read path
    • Why? → No reference in AGENTS.md / system prompt
    • Why? → Setup assumed it would be read
    • Why? → No verification mechanism

3. Verify Flow Reachability (CRITICAL)

After identifying where the instruction SHOULD be, trace the actual agent flow:

START: New session begins
  ↓
READ: System prompt loaded
  ↓
READ: AGENTS.md (if exists)
  ↓
READ: MEMORY.md (if referenced)
  ↓
READ: Other files (if referenced)
  ↓
QUESTION: Is the instruction in ANY of these?

If instruction is NOT in the flow:

  • The fix is NOT to "write it somewhere"
  • The fix IS to add it to a file ALREADY in the flow
  • OR add a reference to its location in a file ALREADY in the flow

4. User Consent Required

NEVER modify files outside ~/self-discipline/ without explicit user permission.

When suggesting changes to AGENTS.md, HEARTBEAT.md, or other files:

ActionRequirement
---------------------
Create ~/self-discipline/Ask permission first
Edit AGENTS.mdShow exact changes, wait for approval
Add to HEARTBEAT.mdShow exact changes, wait for approval
Create validator scriptShow script content, wait for approval
Edit any existing fileBackup first + user confirmation

Flow for external file changes:

  1. Explain WHY the change is needed
  2. Show EXACTLY what will be added/changed
  3. Wait for explicit "yes" / approval
  4. Only then make the change

5. Generate Validators for Critical Issues

For 🔴 CRITICAL issues, create automated validators:

# Example: ~/self-discipline/validators/pre-send/no-secrets.sh
#!/usr/bin/env bash
set -euo pipefail

# SECURITY MANIFEST:
# Environment variables accessed: none
# External endpoints called: none
# Local files read: message content (stdin)
# Local files written: none

# Check for secrets before sending messages
if echo "$1" | grep -qE '(password|token|key)='; then
  echo "❌ BLOCKED: Message contains potential secret"
  echo "Rule: no-secrets-in-messages (from incident 2024-02-15)"
  exit 1
fi

Validators must:

  • Exit 0 = pass, exit 1 = block
  • Include the rule origin (incident reference)
  • Never modify data, only check

6. Track Enforcement

In ~/self-discipline/memory.md, maintain:

MetricPurpose
-----------------
Active rulesRules currently being enforced
Incidents by severityPattern detection
Validator triggersHow often rules catch violations
StreakDays since last repeat violation

7. Escalation Path

If the same rule is violated twice:

  1. First violation: Full analysis + fix
  2. Second violation: Promote to CRITICAL + mandatory validator
  3. Third violation: STOP and ask user for intervention

Severity Assessment Protocol

See severity.md for detailed criteria.

Quick Assessment

QuestionIf YES →
--------------------
Is the user visibly upset?+1 severity
Could this cause data loss?Automatic CRITICAL
Could this cause security breach?Automatic CRITICAL
Could this affect production?Automatic CRITICAL
Has this happened before?+1 severity
Did user use "never" or "always"?+1 severity

The Flow Verification Process

See flow-verification.md for complete protocol.

Why Instructions Get Ignored

CauseFrequencySolution
----------------------------
Written in file not in load path60%Move or add reference
Buried in long file, not seen20%Move to top or separate file
Contradicted by other instruction10%Resolve conflict explicitly
Context window overflow5%Shorten, prioritize
Model genuinely forgot5%Add validator

Verification Steps

  1. Identify all files in agent's load path (system prompt, AGENTS.md, etc.)
  2. Check if instruction location is in that path
  3. If not in path: Find where to add reference
  4. If in path but buried: Move to more prominent location
  5. If contradicted: Resolve with explicit priority

Validator Patterns

See validators.md for complete reference.

Types

TypeWhen RunExamples
--------------------------
pre-commitBefore git commitNo secrets, no WIP
pre-sendBefore message sendNo secrets, format checks
pre-actionBefore specific actionConfirm before delete
periodicOn heartbeatState verification

Validator Template

#!/usr/bin/env bash
set -euo pipefail

# SECURITY MANIFEST:
# Environment variables accessed: [list]
# External endpoints called: [list or "none"]
# Local files read: [list]
# Local files written: [list or "none"]

# Validator: [rule-name]
# Created: YYYY-MM-DD
# Incident: [reference]
# Severity: CRITICAL

# [description of what this validates]

[validation logic]

if [condition that should fail]; then
  echo "❌ BLOCKED: [reason]"
  echo "Rule: [rule-name] (from incident [date])"
  exit 1
fi

exit 0

Common Traps

TrapConsequenceSolution
-----------------------------
Writing rule in memory.md onlyFuture agent won't see itAdd to rules.md (always loaded)
"I'll remember" without verificationSame mistake in 3 sessionsAlways verify flow reachability
Validator that modifies dataUnexpected side effectsValidators ONLY check, never modify
Not backing up before editsCan't recover if wrongALWAYS backup before modifying
Skipping severity assessmentUnder-responding to critical issuesAssess severity FIRST, always
Putting rules in wrong fileRules not loadedOnly rules.md is guaranteed loaded

Commands

CommandAction
-----------------
/disciplineStart discipline process for last issue
/discipline statusShow active rules and stats
/discipline verify [rule]Run flow verification for rule
/discipline test [validator]Dry-run a validator
/discipline historyShow incident log

Security & Privacy

Data that stays local:

  • All rules, incidents, and validators in ~/self-discipline/
  • No data sent to external services
  • No telemetry or analytics

This skill does NOT:

  • Make network requests
  • Access credentials or secrets
  • Modify files without explicit user permission
  • Run validators without user approval
  • Access files outside ~/self-discipline/ without asking

File modifications outside ~/self-discipline/:

  • Only suggested when needed for rule visibility (e.g., AGENTS.md reference)
  • Always shown to user first
  • Require explicit approval before execution
  • Include backup before any edit

Related Skills

Install with clawhub install if user confirms:

  • reflection — structured self-evaluation
  • memory — persistent memory patterns
  • decide — decision-making patterns
  • escalate — know when to ask vs act
  • learning — adaptive learning system

Feedback

  • If useful: clawhub star self-discipline
  • Stay updated: clawhub sync

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-03-29 22:24 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,212 📥 266,272
productivity

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 438 📥 147,299
security-compliance

MoltGuard - Security & Antivirus & Guardrails

thomaslwang
MoltGuard — OpenClaw 安全守卫,由 OpenGuardrails 提供。安装 MoltGuard,保护您和您的用户免受提示注入、数据泄露和恶意攻击。
★ 116 📥 30,704