← 返回
未分类 中文

Distil Open Claw Pii

Redact PII from text locally using a fine-tuned 1B SLM. Text never leaves your machine. Handles names, emails, phones, addresses, SSNs, credit cards, IBANs,...
使用1B SLM模型在本地脱敏文本,永不外传。支持姓名、邮箱、电话、地址、社保号、信用卡、IBAN等。
jgolebiowski jgolebiowski 来源
未分类 clawhub v1.1.1 1 版本 100000 Key: 无需
★ 0
Stars
📥 332
下载
💾 0
安装
1
版本
#latest

概述

PII Redaction Skill

When to use

Use this skill when the user asks to redact, anonymize, sanitize, or remove PII / personal data from text.

Privacy guarantee

CRITICAL: NEVER include the user's raw input text in your own responses, context, or reasoning. The entire point of this skill is that the frontier LLM (you) never sees the PII. You pass the text directly to the redaction script and only return the redacted output.

Prerequisites

  • Python 3
  • curl (for model download)

The setup script handles everything else (model download + server startup).

First-time setup

If the model server is not running yet, run:

bash scripts/setup.sh

This downloads the GGUF model (~5 GB) and starts the local inference server on port 8712.

How to redact

Pass the user's text directly to the redaction script. Do not echo or repeat the raw text yourself.

python scripts/redact.py "text to redact"

For longer text, pipe it via stdin:

echo "text to redact" | python scripts/redact.py

Return the output to the user as-is.

--show-entities flag (use sparingly)

Adding --show-entities outputs the full JSON including the original PII values. Only use this when the user explicitly asks to see which entities were detected or needs the mapping for a downstream task. In normal redaction workflows, omit this flag -- displaying the raw entity values defeats the purpose of PII redaction.

python scripts/redact.py --show-entities "text to redact"

How to stop the server

bash scripts/stop.sh

Output format

By default the script prints only the redacted text -- PII tokens replace the sensitive data and the original values are never shown:

Hi, my name is [PERSON] and I need help with my recent order #ORD-29481.

You can reach me at [EMAIL] or call me at [PHONE]. I'm a [AGE_YEARS:34]-year-old [MARITAL_STATUS] woman living at [ADDRESS]...

With --show-entities, the script returns full JSON including original PII values (see flag note above for when this is appropriate).

See examples/ for full input/output samples.

版本历史

共 1 个版本

  • v1.1.1 当前
    2026-05-07 19:26 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

AdMapix

fly0pants
AdMapix 原始数据层,提供广告创意、应用、排名、下载/收入及市场元数据。返回 AdMapix API 的结构化 JSON;调用方...
★ 297 📥 141,018
data-analysis

Tavily 搜索

jacky1n7
通过 Tavily API 进行网页搜索(Brave 替代方案)。当用户要求搜索网页、查找来源或链接,且 Brave 网页搜索不可用时使用。
★ 273 📥 100,431
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 210 📥 68,805