← 返回
效率工具 中文

Skill Test

Test skills before using or publishing. Trial, compare, evaluate in isolation without affecting your environment.
在使用或发布前测试技能。独立试用、比较和评估,不影响您的环境。
ivangdavila
效率工具 clawhub v1.0.0 1 版本 99719.1 Key: 无需
★ 5
Stars
📥 2,740
下载
💾 27
安装
1
版本
#latest

概述

Test Skills Safely

Two use cases:

  1. Try before commit — Test drive skills before installing
  2. Evaluate before publish — Verify quality before publishing

Key principle: Test in isolation. Never affect user's environment.

References:

  • Read sandbox.md — Isolated testing environment
  • Read compare.md — A/B comparison between skills
  • Read evaluate.md — Multi-agent quality evaluation

Quick Start

Trial a skill:

sessions_spawn(
  task="Test skill X: Load ONLY its SKILL.md, run [sample task], report quality",
  model="anthropic/claude-haiku"
)

Compare two skills:

  1. Run same task through each (separate sub-agents)
  2. Present outputs side-by-side
  3. Ask: "Which works better? Why?"

Test Modes

Trial Mode — Before installing

  • Spawn sub-agent with ONLY the test skill
  • Run 2-3 representative tasks
  • Evaluate: Does it help? Clear instructions?
  • Decision: keep, pass, or try another

Evaluation Mode — Before publishing

  • Spawn specialized reviewers (see evaluate.md)
  • Check structure, safety, usefulness
  • Synthesize findings
  • Recommend improvements

Sandbox Isolation

⚠️ Never load test skill into your main context.

Sub-agent approach (recommended):

sessions_spawn(
  task="You have ONE skill loaded: [skill content]. Test by doing: [task]",
  model="anthropic/claude-haiku"
)
  • Complete isolation — main session unaffected
  • Natural cleanup — sub-agent terminates, done
  • Cheap testing — use Haiku

What to check:

  • Does it activate correctly?
  • Are instructions clear?
  • Token cost reasonable?
  • Output quality acceptable?

Edge Cases

Skill requires credentials: Ask user for test credentials or skip auth-dependent features.

Skill not found: Verify slug with npx clawhub info before testing.

Test fails mid-way: Sub-agent terminates cleanly. Review logs, adjust test task, retry.

Skill has many auxiliary files: Load SKILL.md first, reference others only if needed during test.


Test thoroughly. Install only after explicit user approval.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-28 22:18 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

productivity

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 438 📥 147,316
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,353 📥 317,849
productivity

Weather

steipete
获取当前天气和预报(无需API密钥)
★ 445 📥 226,156