← 返回
AI智能 中文

Humanize Chinese 2.0.0

Detect and humanize AI-generated Chinese text. 20+ detection categories, weighted 0-100 scoring with sentence-level analysis, 7 style transforms (casual/zhih...
检测并人性化AI生成的中文文本。支持20+检测类别、0-100加权评分、句子级分析及7种风格转换(如口语/知乎风...)。
0xspeter
AI智能 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 534
下载
💾 51
安装
1
版本
#latest

概述

Humanize Chinese AI Text v2.0

Comprehensive CLI for detecting and transforming Chinese AI-generated text. Makes robotic AI writing natural and human-like.

v2.0 highlights: weighted 0-100 scoring, sentence-level analysis, sentence restructuring (merge/split), context-aware replacement, rhythm variation, vocabulary diversification, 7 style transforms, external pattern config (patterns_cn.json).

Quick Start

# Detect AI patterns (20+ categories, 0-100 score)
python scripts/detect_cn.py text.txt
python scripts/detect_cn.py text.txt -v          # verbose + worst sentences
python scripts/detect_cn.py text.txt -s           # score only
python scripts/detect_cn.py text.txt -j           # JSON output

# Humanize text
python scripts/humanize_cn.py text.txt -o clean.txt
python scripts/humanize_cn.py text.txt --scene social
python scripts/humanize_cn.py text.txt --scene tech -a   # aggressive mode
python scripts/humanize_cn.py text.txt --seed 42         # reproducible

# Apply writing styles
python scripts/style_cn.py text.txt --style zhihu -o zhihu.txt
python scripts/style_cn.py text.txt --style xiaohongshu
python scripts/style_cn.py --list

# Compare before/after
python scripts/compare_cn.py text.txt --scene tech -a
python scripts/compare_cn.py text.txt -o clean.txt

Detection System

Scoring

Weighted 0-100 score with 4 severity levels:

ScoreLevelMeaning
-----------------------
0-24LOWLikely human-written
25-49MEDIUMSome AI signals
50-74HIGHProbably AI-generated
75-100VERY HIGHAlmost certainly AI

Detection Categories

🔴 Critical (weight: 8)

CategoryExamples
--------------------
Three-Part Structure首先...其次...最后, 一方面...另一方面, 其一...其二...其三
Mechanical Connectors值得注意的是, 综上所述, 不难发现, 归根结底, 由此可见
Empty Grand Words赋能, 闭环, 数字化转型, 协同增效, 全方位, 多维度

🟠 High Signal (weight: 4)

CategoryExamples
--------------------
AI High-Frequency Words助力, 彰显, 底层逻辑, 抓手, 触达, 沉淀, 复盘
Filler Phrases值得一提的是, 众所周知, 毫无疑问
Balanced Arguments虽然...但是...同时, 既有...也有...更有
Template Sentences随着...的不断发展, 在当今...时代, 作为...的重要组成部分

🟡 Medium Signal (weight: 2)

CategoryExamples
--------------------
Hedging Language在一定程度上, 某种程度上, 通常情况下 (>5 occurrences)
List AddictionExcessive numbered/bulleted lists
Punctuation OveruseDense em dashes, semicolons
Excessive Rhetoric对偶/排比句过多

⚪ Style Signal (weight: 1.5)

CategoryDescription
-----------------------
Uniform ParagraphsLow CV in paragraph lengths
Low BurstinessMonotonous sentence lengths
Emotional FlatnessLack of emotional/personal expressions
Repetitive StartersSame sentence starters >3 times
Low EntropyLow character-level entropy (predictable text)

Sentence-Level Analysis

With -v (verbose) mode, the detector identifies the most AI-like sentences:

── 最可疑句子 ──
  1. [16分] 随着人工智能技术的不断发展,在当今数字化转型时代...
     原因: 数字化转型, 深度融合, 模板: 随着.*?的(不断)?发展

Humanization Engine

Transforms (applied in order)

  1. Structure cleanup — Remove three-part structure (首先/其次/最后)
  2. Phrase replacement — Context-aware replacement of AI phrases (regex patterns first, then plain text, longest-first matching)
  3. Sentence merge — Merge overly short consecutive sentences
  4. Sentence split — Split long sentences at natural breakpoints (但是/不过/同时)
  5. Punctuation normalization — Reduce excessive semicolons, em dashes
  6. Vocabulary diversification — Replace repeated words (进行/实现/提供 etc.) with synonyms
  7. Paragraph rhythm — Vary uniform paragraph lengths (merge short, split long)
  8. Casual injection — Add human expressions (scene-dependent)
  9. Paragraph shortening — For social/chat scenes

Scenes

SceneCasualnessBest For
----------------------------
general0.3Default, balanced
social0.7Social media, short posts
tech0.3Tech blogs, tutorials
formal0.1Formal articles, reports
chat0.8Conversations, messaging

Aggressive Mode (-a)

Adds +0.3 casualness, more colloquial expressions, stronger sentence restructuring. Typical score reduction: 60-80 points on heavily AI-generated text.

Reproducibility

Use --seed N for reproducible results (same input + seed = same output).


Writing Style Transforms

7 specialized Chinese writing styles:

StyleNameDescription
--------------------------
casual口语化Like chatting with friends — natural, relaxed
zhihu知乎Rational, in-depth, personal opinions
xiaohongshu小红书Enthusiastic, emoji-rich, product-focused
wechat公众号Storytelling, engaging, relatable
academic学术Rigorous, precise, no colloquialisms
literary文艺Poetic, imagery-rich, metaphorical
weibo微博Short, opinionated, shareable

Combine humanize + style

python scripts/humanize_cn.py text.txt --style xiaohongshu -o xhs.txt

This first humanizes (removes AI patterns) then applies the style transform.


External Configuration

All patterns, replacements, and scoring weights are in scripts/patterns_cn.json. Edit this file to:

  • Add new AI vocabulary patterns
  • Customize replacement alternatives
  • Adjust scoring weights per severity
  • Add regex patterns for template detection
  • Set thresholds for hedging language detection

Scripts Reference

detect_cn.py

python scripts/detect_cn.py [file] [-j] [-s] [-v] [--sentences N]
FlagDescription
-------------------
-jJSON output
-sScore only (e.g. "72/100 (high)")
-vVerbose: show worst sentences
--sentences NNumber of worst sentences to show (default: 5)

humanize_cn.py

python scripts/humanize_cn.py [file] [-o output] [--scene S] [--style S] [-a] [--seed N]
FlagDescription
-------------------
-oOutput file
--scenegeneral/social/tech/formal/chat
--stylecasual/zhihu/xiaohongshu/wechat/academic/literary/weibo
-aAggressive mode
--seedRandom seed for reproducibility

style_cn.py

python scripts/style_cn.py [file] --style S [-o output] [--seed N] [--list]

compare_cn.py

python scripts/compare_cn.py [file] [-o output] [--scene S] [--style S] [-a]

Shows score diff, category changes, and metric comparison before/after humanization.


Workflow

# 1. Check AI score
python scripts/detect_cn.py document.txt -v

# 2. Humanize with comparison
python scripts/compare_cn.py document.txt --scene tech -a -o clean.txt

# 3. Verify improvement
python scripts/detect_cn.py clean.txt -s

# 4. Optional: apply specific style
python scripts/style_cn.py clean.txt --style zhihu -o final.txt

Batch Processing

# Scan all files
for f in *.txt; do
  echo "=== $f ==="
  python scripts/detect_cn.py "$f" -s
done

# Transform all markdown
for f in *.md; do
  python scripts/humanize_cn.py "$f" --scene tech -a -o "${f%.md}_clean.md"
done

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 05:26 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Binance Pro 1.0.0

0xspeter
币安全功能集成,全球最大加密货币交易所。支持现货、125倍杠杆合约、质押及资产管理。
★ 0 📥 701
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,058 📥 797,631
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,353 📥 317,926