← 返回
数据分析 中文

Data Ground Truth

Before presenting numbers in reports or recommendations, verify facts and check values against industry baselines.
在报告或建议中呈现数据前,核实事实并与行业基准比对。
cutthemustard
数据分析 clawhub v1.0.1 1 版本 100000 Key: 无需
★ 0
Stars
📥 533
下载
💾 5
安装
1
版本
#latest

概述

data-ground-truth

When presenting numbers, metrics, or statistics in reports, recommendations, or analysis — verify the facts and contextualize the figures against industry baselines. Combines verify (live fact-checking) with norm (statistical benchmarking).

When to Activate

Use this skill when:

  • Writing a report that cites specific metrics (revenue, churn, conversion rates)
  • A user shares their business numbers and asks "is this good?"
  • Comparing a metric to industry standards ("how does our 5% churn compare?")
  • Building a recommendation that depends on current market data
  • Presenting financial figures that may have changed since training
  • Analyzing a dataset and wanting to flag outliers against known baselines

Do NOT use for: opinions, qualitative assessments, or metrics with no established baseline.

Workflow

Step 1: Classify the data point

Determine whether each number is:

  • A factual claim (exchange rate, stock price, population) → route to verify
  • A business/performance metric (churn rate, NPS, response time) → route to norm
  • Both (e.g., "our conversion rate of 3.2% is above average") → check both

Step 2: Verify factual claims

For current facts (prices, rates, dates), use verify-claim.

MCP (preferred): verify_claim({ claim: "The USD to EUR exchange rate is 0.92" })

HTTP:

curl -X POST https://verify.agentutil.net/v1/verify \
  -H "Content-Type: application/json" \
  -d '{"claim": "The USD to EUR exchange rate is 0.92"}'

Handle verdicts per the verify-claim decision tree (confirmed → use, stale → update, disputed → present both sides, false → correct).

Step 3: Benchmark metrics against baselines

For business metrics, check where the value falls on the distribution.

MCP (preferred): norm_check({ category: "saas:churn_rate_monthly", value: 5.2, unit: "%" })

HTTP:

curl -X POST https://norm.agentutil.net/v1/check \
  -H "Content-Type: application/json" \
  -d '{"category": "saas:churn_rate_monthly", "value": 5.2, "unit": "%"}'

For multiple metrics at once:

curl -X POST https://norm.agentutil.net/v1/batch \
  -H "Content-Type: application/json" \
  -d '{"items": [{"category": "saas:churn_rate_monthly", "value": 5.2}, {"category": "saas:nps_score", "value": 45}]}'

Optional: add company_size (startup/smb/mid_market/enterprise) and region for more specific baselines.

Step 4: Present with context

When reporting findings, combine verification and benchmarking:

Data typeHow to present
--------------------------
Verified fact"The current [metric] is [current_truth] (verified live, [freshness])."
Benchmarked metric"[Value] is at the [percentile]th percentile — [assessment] for [category]."
Both"At [current_truth] (verified), this is [percentile]th percentile vs. industry ([baseline source])."
Anomalous metricFlag clearly: "[Value] is [assessment] — [percentile]th percentile. The typical range is [p25]-[p75]."

Assessment values from norm: very_low, low, normal, high, very_high, anomalous.

Available baseline categories

121 baselines across 14 domains. Browse with:

curl https://norm.agentutil.net/v1/categories

Common categories: saas:churn_rate_monthly, saas:nps_score, saas:ltv_cac_ratio, ecommerce:cart_abandonment_rate, infrastructure:api_latency_p99, infrastructure:uptime_percentage.

Data Handling

This skill sends claims (natural language text) and metric values (category identifiers + numbers) to two external APIs. No documents, user data, or file contents are transmitted.

Pricing

  • Verify: 25 free/day, then $0.004/query
  • Norm: free category listing, $0.002/check or $0.001/batch item
  • Full ground-truth check (verify + norm): ~$0.006 per data point

All via x402 protocol (USDC on Base). No authentication required for free tiers.

Privacy

No personal data collected. Claims cached up to 1 hour (verify), metric checks are stateless (norm). Rate limiting uses IP hashing only.

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-03-30 20:57 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 164 📥 59,860
ai-intelligence

Clawhub Skill

cutthemustard
对照实时数据源核实事实陈述,返回带有置信度的结构化判定结果。
★ 0 📥 1,080
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 198 📥 65,014