← 返回
未分类

Wechat Mp Fetch

WeChat Official Account article fetcher — extracts title, body text, and final URL from mp.weixin.qq.com links via Playwright. 微信公众号文章抓取工具,提取标题、正文、原始URL,支持重定...
使用 Playwright 抓取微信公众号文章标题、正文及最终 URL,支持重定向。
write31bug write31bug 来源
未分类 clawhub v1.1.1 1 版本 100000 Key: 无需
★ 1
Stars
📥 571
下载
💾 0
安装
1
版本
#latest

概述

📖 WeChat Article Fetch | 微信公众号文章抓取

> Extract article title, body text, and original URL from WeChat Official Account links (mp.weixin.qq.com)


✨ Features | 功能

  • 🎯 Title Extraction — Extracts article title from rendered page
  • 📝 Body Text — Extracts clean text content from #js_content
  • 🔗 URL Resolution — Handles redirects, returns final canonical URL
  • 🌐 Full Rendering — Uses Playwright/Chromium for JS-heavy pages
  • 🔒 Privacy First — 100% local, no data uploaded anywhere

🚀 Quick Start

Installation

cd <skill-path>
npm install
npx playwright install chromium

Usage

node scripts/wx-article-fetch.js "https://mp.weixin.qq.com/s/xxxxx"

Output

{
  "success": true,
  "title": "文章标题",
  "content": "正文内容...",
  "url": "https://mp.weixin.qq.com/s/xxxxx"
}

💡 Usage Scenarios | 使用场景

ScenarioDescription场景
-----------------------------
📚 Content ArchivalSave articles for offline reading文章离线保存
📝 Note-takingConvert articles to notes文章转笔记
🔍 ResearchBatch collect article content批量采集资料
✍️ Writing ReferenceExtract key info for writing写作素材收集
🔄 Content RepurposingExtract text for rewriting内容再创作

⚠️ Known Limitations | 已知限制

IssueDescription
--------------------
🔐 Login RequiredSome articles require WeChat login
💰 Paid ContentPaywalled articles cannot be fetched
🔒 Private AccountsPrivate official accounts inaccessible
🖼️ ImagesCurrently extracts text only; images keep original URLs

🔧 Technical Details | 技术细节

  • Rendering Engine: Playwright + headless Chromium
  • Content Selector: #js_content container
  • No External APIs: All processing is 100% local
  • Browser Mode: Headless, no UI, no state leakage

🛡️ Security & Privacy

  • 100% Local — All operations run in local browser, no external server
  • No Login Required — No WeChat credentials needed
  • No Data Storage — Content exists only in caller's session
  • No Tracking — No analytics, no telemetry, no third-party deps

📁 Project Structure

wechat-mp-fetch/
├── _meta.json
├── SKILL.md
├── package.json
├── package-lock.json
└── scripts/
    └── wx-article-fetch.js    # Main script

🔗 Links

  • GitHub: https://github.com/write31bug/wechat-mp-fetch
  • npm: https://www.npmjs.com/package/wechat-mp-fetch
  • ClawHub: https://clawhub.ai/skills/wechat-mp-fetch

版本历史

共 1 个版本

  • v1.1.1 当前
    2026-05-03 04:52 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

professional

股票技术分析

write31bug
专业股票与基金技术分析工具,覆盖A股、港股、美股——集成均线(MA)、MACD、RSI、KDJ、布林带、ATR及评分系统。
★ 1 📥 1,022
data-analysis

AdMapix

fly0pants
AdMapix 原始数据层,提供广告创意、应用、排名、下载/收入及市场元数据。返回 AdMapix API 的结构化 JSON;调用方...
★ 297 📥 141,633
data-analysis

Tavily 搜索

jacky1n7
通过 Tavily API 进行网页搜索(Brave 替代方案)。当用户要求搜索网页、查找来源或链接,且 Brave 网页搜索不可用时使用。
★ 273 📥 100,683