← 返回
未分类 Key

AI-powered DP Data Processing Pipeline Designer

DP 数据处理平台流水线设计师。当用户提到创建管道、数据流、Flink作业、Kafka读取、写到数据库/MongoDB、数据处理管道等需求时激活。
DP 数据处理平台流水线设计师,当用户提及创建管道、数据流、Flink作业、Kafka读取、写入数据库/MongoDB、数据处理管道等需求时激活。
hxp365 hxp365 来源
未分类 clawhub v2.0.1 1 版本 100000 Key: 需要
★ 0
Stars
📥 357
下载
💾 0
安装
1
版本
#data-processing#etl#flink#latest#pipeline

概述

dp-pipeline-designer

Purpose

Design, generate, submit, and monitor DP Data Processing Platform pipelines from natural language descriptions. This skill acts as an AI front-end to the DP platform's REST API, translating user requirements into runnable Apache Flink data pipelines.

The DP platform runs at ${DP_SERVER_URL} by default.

Environment Configuration

The DP platform runs at ${DP_SERVER_URL}.

Required environment variables:

DP_SERVER_URL=${DP_SERVER_URL}   # REQUIRED — DP platform base URL
DP_API_KEY=${DP_API_KEY}         # REQUIRED — obtain from platform「API Key 管理」page

ALL curl commands MUST use -H 'X-DP-API-Key: ${DP_API_KEY}'. No other authentication method is supported.

首次使用引导

# 校验 DP_API_KEY — 未配置则终止
if [ -z "${DP_API_KEY}" ]; then
  echo "======================================"
  echo "  DP Platform — API Key 必填"
  echo "======================================"
  echo "错误:未检测到 DP_API_KEY,无法继续。"
  echo ""
  echo "请按以下步骤配置:"
  echo "1. 访问 DP 平台控制台:${DP_SERVER_URL}"
  echo "2. 注册账号(如需邀请码请联系管理员)"
  echo "3. 进入「API Key 管理」→「申请新 Key」"
  echo "4. 将生成的 Key 配置到 DP_API_KEY 环境变量"
  echo ""
  echo "免费版:100次/月。超额后需升级订阅套餐。"
  echo "======================================"
  exit 1
fi
echo "API Key 已验证:${DP_API_KEY:0:8}****"

配额说明

  • 免费版:100 次/月 API 调用额度
  • 超额时响应中会包含 quota_exceeded: true 字段
  • 响应中的 upgrade_url 字段指向订阅升级页面
  • 升级套餐可获得更多额度:BASIC(1000次)、PRO(10000次)、ENTERPRISE(不限)

Capabilities

  • Ask clarifying questions to understand data source, transformation logic, and output target
  • Generate a complete StreamJob XML configuration
  • Submit the job via REST API and start execution
  • Poll job status and report progress per operator
  • Retrieve error logs and suggest root-cause fixes
  • Support 4 built-in industry scenario templates

Context Files

The following files provide the knowledge base for this skill:

FilePurpose
------
dp-operator-catalog.jsonComplete list of 60+ operators with params and pipeline templates
dp-api-reference.mdREST API endpoints, parameters, and curl examples
dp-job-schema.mdStreamJob XML format, examples, and encoding rules

Workflow

Phase 1: Requirements Gathering

When the user describes a pipeline need, ask ALL of the following if not already specified:

  1. Data Source: Where does data come from?
  2. Transformations: What should happen to the data?
  3. Output Target: Where should results go?
  4. Schedule: How should the job run?
  5. Resources: What connections are already configured?

Phase 2-5: Pipeline Design, XML Generation, Job Submission, Status Monitoring

See full SKILL.md for complete implementation details.

Limitations & Guardrails

  1. Only use operators listed in dp-operator-catalog.json
  2. Always show the design to the user for confirmation before generating XML
  3. Do not store passwords in XML files — use resourceId references
  4. If DP Server is unreachable, guide user to start services
  5. Parallelism default is 1 for new jobs

License

MIT-0: Free to use, modify, and redistribute. No attribution required.

版本历史

共 1 个版本

  • v2.0.1 当前
    2026-05-07 05:16 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 211 📥 69,058
data-analysis

AdMapix

fly0pants
AdMapix 原始数据层,提供广告创意、应用、排名、下载/收入及市场元数据。返回 AdMapix API 的结构化 JSON;调用方...
★ 297 📥 141,324
data-analysis

Tavily 搜索

jacky1n7
通过 Tavily API 进行网页搜索(Brave 替代方案)。当用户要求搜索网页、查找来源或链接,且 Brave 网页搜索不可用时使用。
★ 273 📥 100,544