← 返回
内容创作 中文

Experiment Designer

Use when planning product experiments, writing testable hypotheses, estimating sample size, prioritizing tests, or interpreting A/B outcomes with practical s...
用于产品实验规划、可检验假设编写、样本量估算、测试优先级排序以及A/B实验结果的解读(关注实际显著性)。
alirezarezvani
内容创作 clawhub v2.1.1 1 版本 99906.5 Key: 无需
★ 2
Stars
📥 1,029
下载
💾 29
安装
1
版本
#latest

概述

Experiment Designer

Design, prioritize, and evaluate product experiments with clear hypotheses and defensible decisions.

When To Use

Use this skill for:

  • A/B and multivariate experiment planning
  • Hypothesis writing and success criteria definition
  • Sample size and minimum detectable effect planning
  • Experiment prioritization with ICE scoring
  • Reading statistical output for product decisions

Core Workflow

  1. Write hypothesis in If/Then/Because format
    • If we change [intervention]
    • Then [metric] will change by [expected direction/magnitude]
    • Because [behavioral mechanism]
  1. Define metrics before running test
    • Primary metric: single decision metric
    • Guardrail metrics: quality/risk protection
    • Secondary metrics: diagnostics only
  1. Estimate sample size
    • Baseline conversion or baseline mean
    • Minimum detectable effect (MDE)
    • Significance level (alpha) and power

Use:

python3 scripts/sample_size_calculator.py --baseline-rate 0.12 --mde 0.02 --mde-type absolute
  1. Prioritize experiments with ICE
    • Impact: potential upside
    • Confidence: evidence quality
    • Ease: cost/speed/complexity

ICE Score = (Impact Confidence Ease) / 10

  1. Launch with stopping rules
    • Decide fixed sample size or fixed duration in advance
    • Avoid repeated peeking without proper method
    • Monitor guardrails continuously
  1. Interpret results
    • Statistical significance is not business significance
    • Compare point estimate + confidence interval to decision threshold
    • Investigate novelty effects and segment heterogeneity

Hypothesis Quality Checklist

  • [ ] Contains explicit intervention and audience
  • [ ] Specifies measurable metric change
  • [ ] States plausible causal reason
  • [ ] Includes expected minimum effect
  • [ ] Defines failure condition

Common Experiment Pitfalls

  • Underpowered tests leading to false negatives
  • Running too many simultaneous changes without isolation
  • Changing targeting or implementation mid-test
  • Stopping early on random spikes
  • Ignoring sample ratio mismatch and instrumentation drift
  • Declaring success from p-value without effect-size context

Statistical Interpretation Guardrails

  • p-value < alpha indicates evidence against null, not guaranteed truth.
  • Confidence interval crossing zero/no-effect means uncertain directional claim.
  • Wide intervals imply low precision even when significant.
  • Use practical significance thresholds tied to business impact.

See:

  • references/experiment-playbook.md
  • references/statistics-reference.md

Tooling

scripts/sample_size_calculator.py

Computes required sample size (per variant and total) from:

  • baseline rate
  • MDE (absolute or relative)
  • significance level (alpha)
  • statistical power

Example:

python3 scripts/sample_size_calculator.py \
  --baseline-rate 0.10 \
  --mde 0.015 \
  --mde-type absolute \
  --alpha 0.05 \
  --power 0.8

版本历史

共 1 个版本

  • v2.1.1 当前
    2026-03-29 14:45 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,547
productivity

marketing-ideas

alirezarezvani
当用户需要为其 SaaS 或软件产品获取营销创意、灵感或策略时使用。也适用于用户提问“营销创意”、“增长”等关键词时。
★ 4 📥 7,614
content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,246