← 返回
AI智能

Self Improvement

Generic agent self-improvement skill built on OpenClaw-RL research (arxiv.org/abs/2603.10165). Captures evaluative signals (+1/-1) and directive hints from a...
Generic agent self-improvement skill built on OpenClaw-RL research (arxiv.org/abs/2603.10165). Captures evaluative signals (+1/-1) and directive hints from a...
adelpro
AI智能 clawhub v1.0.0 1 版本 99825.2 Key: 无需
★ 1
Stars
📥 3,406
下载
💾 798
安装
1
版本
#latest

概述

Self-Improvement Skill

Enables any agent to learn and improve from user feedback using PRM-style evaluation.

What It Does

  • Capture Feedback: Store user responses to agent outputs
  • Evaluate: Extract score (+1 positive, -1 correction)
  • Analyze: Find patterns in directive hints
  • Suggest: Generate actionable improvements

Concept (OpenClaw-RL Inspired)

From OpenClaw-RL paper:

> "Next-state signals encode both evaluative and directive information about the preceding action."

Two signal types:

  1. Evaluative: Did the output work? (binary score)
  2. Directive: How should it be different? (hints)

Usage

Capture Feedback

SKILL:self-improvement --feedback "Great!" --job daily-report
SKILL:self-improvement --feedback "Add more stats" --job daily-report

Get Stats

SKILL:self-improvement --stats daily-report

Generate Improvements

SKILL:self-improvement --improve daily-report

Arguments

ArgDescriptionExample
---------------------------
--jobTask/job namedaily-digest, weekly-recap
--feedbackUser response"Thanks!", "Add more X"
--scoreManual score override1, 0, -1
--statsShow feedback statsdaily-digest
--improveGenerate improvementsdaily-digest
--weeklyWeekly summary(flag)

Storage

memory/learning/agent-feedback.json:

{
  "jobs": {
    "daily-digest": {
      "evaluations": [
        { "date": "2026-03-14", "score": 1, "hint": null },
        { "date": "2026-03-13", "score": -1, "hint": "add weekly star comparison" }
      ],
      "improvements": [
        { "date": "2026-03-14", "suggestion": "Add weekly star delta", "implemented": false }
      ]
    }
  }
}

Cron Integration

Daily: Capture Feedback (9:30 AM)

{
  "id": "feedback-capture",
  "schedule": "30 9 * * *",
  "message": "SKILL:self-improvement --job daily-digest"
}

Daily: Generate Improvements (10 AM)

{
  "id": "improvement-suggestions",
  "schedule": "0 10 * * *",
  "message": "SKILL:self-improvement --improve daily-digest → Telegram"
}

Weekly: Summary (Saturday 9 AM)

{
  "id": "weekly-summary",
  "schedule": "0 9 * * 6",
  "message": "SKILL:self-improvement --improve all --weekly"
}

Example Workflow

  1. User receives daily digest
  2. User responds: "Good! But can you show star trends?"
  3. Feedback captured: score=1, hint="show star trends"
  4. Next day, improvements generated:
    • "Add star trend comparison (last 7 days)"
    • "User满意度: 75%"
  5. Agent auto-updates prompt

Improvement Suggestion Format

📈 Improvement Suggestions - {job}

Stats: 8 evaluations, avg score: 0.75

Top Hints:
1. "add weekly star changes" (2x)
2. "use table format" (1x)

Suggested Actions:
• Add 7-day star delta to GitHub section
• Use table-image-generator for stats

Status: 1 improvement pending

Manual Evaluation

If no user feedback available, manually evaluate:

Evaluate yesterday's output as: good/bad

Related Skills

  • compound-engineering: Deep session analysis
  • agent-evolver: Legacy name, see self-improvement

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-27 21:32 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Openclaw Continuous Learning

adelpro
OpenClaw的本能学习系统。分析会话、检测模式、生成带置信度评分的原子学习并提供优化建议。
★ 0 📥 857
ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 709 📥 243,557
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,056 📥 796,390