← 返回
AI智能 中文

prompt-ab-lab

Design, log, compare, and score prompt experiments so users can systematically improve outputs instead of guessing.
设计、记录、比较并评分提示词实验,让用户系统性地优化输出,告别盲目猜测。
52yuanchangxing
AI智能 clawhub v1.0.0 1 版本 99806.2 Key: 无需
★ 0
Stars
📥 515
下载
💾 6
安装
1
版本
#latest

概述

Prompt A/B Lab

Purpose

Design, log, compare, and score prompt experiments so users can systematically improve outputs instead of guessing.

Trigger phrases

  • 比较两个提示词
  • prompt ab test
  • 提示词实验
  • 哪个 prompt 更好
  • 建一个评测表

Ask for these inputs

  • prompt A and B
  • task
  • evaluation criteria
  • test set
  • weights if any

Workflow

  1. Define what success looks like before comparing prompts.
  2. Generate an evaluation rubric and structured test table.
  3. Log outputs per test case and compute weighted scores.
  4. Summarize tradeoffs instead of declaring a winner too early.
  5. Recommend the next experiment iteration.

Output contract

  • experiment plan
  • scored comparison table
  • rubric
  • next-iteration suggestions

Files in this skill

  • Script: {baseDir}/scripts/prompt_experiment_logger.py
  • Resource: {baseDir}/resources/eval_rubric.md

Operating rules

  • Be concrete and action-oriented.
  • Prefer preview / draft / simulation mode before destructive changes.
  • If information is missing, ask only for the minimum needed to proceed.
  • Never fabricate metrics, legal certainty, receipts, credentials, or evidence.
  • Keep assumptions explicit.

Suggested prompts

  • 比较两个提示词
  • prompt ab test
  • 提示词实验

Use of script and resources

Use the bundled script when it helps the user produce a structured file, manifest, CSV, or first-pass draft.

Use the resource file as the default schema, checklist, or preset when the user does not provide one.

Boundaries

  • This skill supports planning, structuring, and first-pass artifacts.
  • It should not claim that files were modified, messages were sent, or legal/financial decisions were finalized unless the user actually performed those actions.

Compatibility notes

  • Directory-based AgentSkills/OpenClaw skill.
  • Runtime dependency declared through metadata.openclaw.requires.
  • Helper script is local and auditable: scripts/prompt_experiment_logger.py.
  • Bundled resource is local and referenced by the instructions: resources/eval_rubric.md.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 03:30 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Proactive Agent

halthelobster
将AI智能体从任务执行者升级为主动预判需求、持续优化的智能伙伴。集成WAL协议、工作缓冲区、自主定时任务及实战验证模式。Hal Stack核心组件 🦞
★ 834 📥 213,006
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,356 📥 318,070
content-creation

paper-assistant

52yuanchangxing
面向论文选题、提纲、摘要、引言、文献综述、研究方法、结果讨论、润色改写与投稿准备的论文助手。
★ 1 📥 1,949