← 返回
数据分析 中文

csv-cleanroom

Profile messy CSV files, standardize columns, detect data quality issues, and produce a reproducible cleanup plan.
分析混乱的CSV文件,标准化列,检测数据质量问题,并生成可重复的清理计划。
52yuanchangxing
数据分析 clawhub v1.0.0 1 版本 99822.1 Key: 无需
★ 0
Stars
📥 561
下载
💾 13
安装
1
版本
#latest#skill

概述

CSV Cleanroom

Purpose

Profile messy CSV files, standardize columns, detect data quality issues, and produce a reproducible cleanup plan.

Trigger phrases

  • 清洗 CSV
  • profile this dataset
  • 数据质量检查
  • 列名规范化
  • build a cleanup plan

Ask for these inputs

  • CSV file or schema
  • target schema if available
  • known bad values
  • dedupe rules
  • date/currency locale

Workflow

  1. Profile the CSV: row count, nulls, duplicates, type mismatches, and outliers.
  2. Normalize headers and map to the target schema.
  3. Generate a step-by-step cleanup plan and optional transformed output.
  4. Document irreversible operations before applying them.
  5. Return a quality score and remediation checklist.

Output contract

  • profile report
  • normalized schema
  • cleanup plan
  • quality scorecard

Files in this skill

  • Script: {baseDir}/scripts/csv_cleanroom.py
  • Resource: {baseDir}/resources/data_quality_checklist.md

Operating rules

  • Be concrete and action-oriented.
  • Prefer preview / draft / simulation mode before destructive changes.
  • If information is missing, ask only for the minimum needed to proceed.
  • Never fabricate metrics, legal certainty, receipts, credentials, or evidence.
  • Keep assumptions explicit.

Suggested prompts

  • 清洗 CSV
  • profile this dataset
  • 数据质量检查

Use of script and resources

Use the bundled script when it helps the user produce a structured file, manifest, CSV, or first-pass draft.

Use the resource file as the default schema, checklist, or preset when the user does not provide one.

Boundaries

  • This skill supports planning, structuring, and first-pass artifacts.
  • It should not claim that files were modified, messages were sent, or legal/financial decisions were finalized unless the user actually performed those actions.

Compatibility notes

  • Directory-based AgentSkills/OpenClaw skill.
  • Runtime dependency declared through metadata.openclaw.requires.
  • Helper script is local and auditable: scripts/csv_cleanroom.py.
  • Bundled resource is local and referenced by the instructions: resources/data_quality_checklist.md.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 00:55 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 198 📥 65,111
content-creation

paper-assistant

52yuanchangxing
面向论文选题、提纲、摘要、引言、文献综述、研究方法、结果讨论、润色改写与投稿准备的论文助手。
★ 1 📥 1,951
data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 165 📥 60,001