← 返回
未分类 中文

Agent Observability - Open the Black Box

Provides full observability for OpenClaw agents via dashboards, decision logs, failure traces, and weekly compliance scoring for effective monitoring and deb...
为OpenClaw代理提供完整的可观测性,包括仪表板、决策日志、故障追踪以及每周合规评分,以实现有效的监控和调试。
donovanpankratz-del donovanpankratz-del 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 168
下载
💾 2
安装
1
版本
#agent-debug#agent-monitoring#agent-quality#audit#black-box#cost#dashboard#debugging#decision-log#drift#drift-detection#failure#latest#metrics#monitoring#observability#openclaw#production#quality#subagent#throughput#tracing#visibility#weekly

概述

Agent Observability

What Gets Installed

FilePurposeLocation
---------
throughput-dashboard.jsWeekly productivity metricsscripts/
decision-audit.jsAppend-only decision log with reasoninglib/
failure-tracer.jsCaptures traces when quality score < 7lib/
drift-guard-auto.jsWeekly INTENT.md compliance scoringscripts/

Installation

Step 1 — Copy files

WORKSPACE="${OPENCLAW_WORKSPACE:-$(pwd)}"

cp references/throughput-dashboard.js  "$WORKSPACE/scripts/"
cp references/decision-audit.js        "$WORKSPACE/lib/"
cp references/failure-tracer.js        "$WORKSPACE/lib/"
cp references/drift-guard-auto.js      "$WORKSPACE/scripts/"

Or manually copy each file from the references/ directory in this skill.

Step 2 — Add to heartbeat/cron (weekly)

In your heartbeat or weekly cron script:

node "$WORKSPACE/scripts/throughput-dashboard.js" "$WORKSPACE"
node "$WORKSPACE/scripts/drift-guard-auto.js" "$WORKSPACE"

Step 3 — Wire decision-audit into high-stakes decisions

const { logDecision } = require('./lib/decision-audit');

logDecision({
  task_type: 'code_generation',
  decision: 'spawn CoderAgent',
  reasoning_summary: 'Multi-file edit blocks chat >5s',
  session_channel: 'discord'  // optional
}, workspaceRoot);

Step 4 — Wire failure-tracer into quality validation (optional)

The failure-tracer fires automatically when you call it after scoring subagent output:

const { captureFailureTrace } = require('./lib/failure-tracer');

// Call after scoring any subagent output
if (qualityScore < 7) {
  captureFailureTrace('AgentLabel-task', qualityScore, agentOutput, workspaceRoot);
}

Reading the Data

PathContents
------
memory/dashboards/YYYY-MM-DD.mdWeekly throughput snapshot
memory/drift-reports/YYYY-MM-DD.mdDrift compliance report
memory/decisions-audit.jsonlFull decision log (JSONL)
memory/traces/[label]-[timestamp].jsonFailure traces

Query examples

# Recent decisions
tail -20 memory/decisions-audit.jsonl | jq .

# All failure traces
ls memory/traces/

# Latest drift report
cat memory/drift-reports/$(ls memory/drift-reports/ | tail -1)

Tool Descriptions

throughput-dashboard.js

Aggregates weekly metrics: tasks routed, subagents spawned, estimated cost, quality ratio, routing distribution. Reads from session-metrics.js (if installed) and drift-guard-auto.js. Degrades gracefully if data sources are missing — every section is independent.

decision-audit.js

Append-only JSONL log at memory/decisions-audit.jsonl. Each entry: { id, ts, task_type, decision, reasoning_summary, outcome, session_channel }. Use updateOutcome(id, 'success', workspaceRoot) to close the loop after a decision resolves.

failure-tracer.js

Fires when quality score < 7. Writes structured JSON to memory/traces/. Each trace includes: tool call sequence hints, output snippet, inferred failure reason. Use to post-mortem why a subagent underperformed.

drift-guard-auto.js

Scores recent agent outputs against behavioral rules (sycophancy, social cushioning, unprompted explanations, hallucination hedges). Reads INTENT.md for custom criteria if installed. Writes weekly report to memory/drift-reports/.

References

  • references/throughput-dashboard.js — Full script implementation
  • references/decision-audit.js — Full lib implementation
  • references/failure-tracer.js — Full lib implementation
  • references/drift-guard-auto.js — Full script implementation

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-12 06:19 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,428 📥 326,966
it-ops-security

Skill Vetter - Pre-Install Security Review

donovanpankratz-del
安装任意AI代理技能前的安全审查协议。针对凭证窃取、混淆代码、数据渗出进行危险信号检测。风险等级分类L...
★ 30 📥 23,684
ai-agent

self-improving agent

pskoett
记录自身发现以实现自我改进的技能
★ 4,143 📥 917,774