← 返回
数据分析 中文

Dataset Evaluation

Evaluate a submission by scoring content consistency of texts and quality of structured data based on completeness, accuracy, type correctness, and informati...
通过完整性、准确性、类型正确性和信息完整性来评估文本内容一致性和结构化数据质量的提交评分。
levey
数据分析 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 494
下载
💾 11
安装
1
版本
#latest

概述

SKILL.md --- dataset_evaluation

Skill Name

dataset_evaluation

Description

Evaluate a miner submission by performing two evaluation steps:

  1. Content Consistency Evaluation
  2. Structured Data Quality Evaluation

The evaluator receives 5 cleaned data samples, the **structured

JSON, and the dataset schema**, then computes a final score for the

miner.


Input

{
  "cleaned_data_list": [
    "cleaned_text_1",
    "cleaned_text_2",
    "cleaned_text_3",
    "cleaned_text_4",
    "cleaned_text_5"
  ],
  "structured_data": {
    "field1": "value",
    "field2": "value"
  },
  "dataset_schema": {
    "fields": [
      {"name": "title", "type": "string", "required": true},
      {"name": "author", "type": "string", "required": false},
      {"name": "date", "type": "string", "required": false},
      {"name": "url", "type": "string", "required": true}
    ]
  }
}

Evaluation Procedure

Step 1 --- Content Consistency Evaluation (Weight 40%)

Goal: determine whether the **5 cleaned texts represent the same

underlying content**.

Method

  1. Normalize text
  • remove HTML
  • lowercase
  • remove excessive whitespace
  1. Compute pairwise similarity across the 5 texts

Recommended metrics:

  • cosine similarity (embedding based)
  • OR Jaccard similarity
  1. Compute the average similarity score.

Output

content_consistency_score (0-100)

Suggested mapping:

avg_similarity >= 0.9 → 100

0.8 – 0.9 → 80 – 100

0.6 – 0.8 → 60 – 80

0.4 – 0.6 → 40 – 60

< 0.4 → < 40


Step 2 --- Structured Data Quality Evaluation (Weight 60%)

Using the verified cleaned content, evaluate the **structured

JSON**.

Compute four sub-scores.


2.1 Field Completeness (30%)

Evaluate whether all required fields exist.

Formula:

completeness_score =

(# required fields present / total required fields) * 100


2.2 Value Accuracy (40%)

Evaluate whether each field value is **consistent with the cleaned

data**.

Examples:

  • title appears in cleaned text
  • author name appears in text
  • url matches source

Scoring guideline:

exact match → 100

partially correct → 60-80

inconsistent → <50


2.3 Type Correctness (15%)

Evaluate whether values match schema types.

Examples:

string

number

boolean

array

Formula:

type_score =

(# correct types / total fields) * 100


2.4 Information Sufficiency (15%)

Evaluate whether the structured data misses obvious information

present in the cleaned text.

Example:

Cleaned text contains:

title

author

date

But structured JSON only includes:

title

Then deduct score.

Guideline:

complete extraction → 100

minor missing info → 70–90

major missing info → <60


Structuring Quality Score

structuring_quality_score =

completeness_score * 0.30

  • value_accuracy_score * 0.40
  • type_score * 0.15
  • information_sufficiency_score * 0.15

Range:

0 – 100


Step 3 --- Final Miner Score

miner_score =

content_consistency_score * 0.4

  • structuring_quality_score * 0.6

Range:

0 – 100


Output Format

The evaluator must return:

{
  "content_consistency_score": 92,
  "structuring_quality_score": 85,
  "miner_score": 88.2,
  "details": {
    "completeness_score": 90,
    "value_accuracy_score": 88,
    "type_score": 100,
    "information_sufficiency_score": 80
  }
}

Evaluator Rules

The evaluator must follow these principles:

  1. Be deterministic and reproducible
  2. Base judgments only on provided inputs
  3. Avoid hallucination
  4. Penalize missing or inconsistent data
  5. Return scores strictly in the 0--100 range

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 03:37 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 198 📥 65,014
data-analysis

Excel / XLSX

ivangdavila
创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。
★ 367 📥 140,249

Agent Store

levey
用于用户想购买、订购、支付或充值 API 密钥或 API 额度。
★ 0 📥 321