← 返回
内容创作

wechat-article-getter

Extract full text from WeChat Official Account (微信公众号) article URLs. Use when a user shares an mp.weixin.qq.com link and asks to read, summarize, analyze, or...
Extract full text from WeChat Official Account (微信公众号) article URLs. Use when a user shares an mp.weixin.qq.com link and asks to read, summarize, analyze, or...
ppppanda
内容创作 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 823
下载
💾 103
安装
1
版本
#latest

概述

WeChat Article Reader

Extract full article content from mp.weixin.qq.com URLs.

When to Use

  • User shares a WeChat article link (mp.weixin.qq.com/s/xxx)
  • Need to read/summarize/analyze/archive a WeChat article
  • ContentPipe Scout node receives a WeChat URL for reference

Quick Start

# First-time setup (installs headless Chromium ~200MB)
python3 SKILL_DIR/scripts/setup.py

# Extract article
python3 SKILL_DIR/scripts/fetch_article.py "https://mp.weixin.qq.com/s/xxx"

# Output: JSON with title, author, publish_time, content, word_count

How It Works

WeChat articles are JS-rendered — HTTP requests only get an empty shell. This skill uses Playwright headless Chromium to:

  1. Launch headless browser with anti-detection flags
  2. Navigate to the WeChat URL, wait for networkidle
  3. Wait for #js_content (article body container)
  4. Extract title (h1#activity-name), author, time, body text
  5. Clean HTML → plain text (strip scripts/styles, compress whitespace)
  6. Return structured JSON

Fallback: Mirror Search

If Playwright is unavailable, the skill searches Chinese content aggregators (53ai.com, 36kr.com, juejin.cn, woshipm.com) for mirror copies of the article.

Python API

from fetch_article import fetch_wechat_article

result = fetch_wechat_article("https://mp.weixin.qq.com/s/xxx")
# result = {
#   "success": True,
#   "title": "文章标题",
#   "author": "作者名",
#   "publish_time": "2026-03-10",
#   "content": "正文全文...",
#   "word_count": 2500,
#   "source": "playwright",  # or "mirror"
#   "url": "https://mp.weixin.qq.com/s/xxx"
# }

Limitations

  • Requires one-time Chromium install (python3 scripts/setup.py)
  • First fetch takes ~5-10s (browser startup); subsequent fetches ~3-5s (browser reuse)
  • Cannot bypass WeChat login walls (paid content, follower-only articles)
  • Mirror fallback only works for popular/widely-shared articles

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-31 03:01 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 859 📥 199,553
content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,163
content-creation

YouTube

byungkyu
使用托管OAuth集成YouTube Data API,支持搜索视频、管理播放列表、获取频道数据及评论互动,适用于用户需要时使用此技能。
★ 142 📥 41,043