← 返回
效率工具 中文

defuddle-web-cleaner

Extract and clean readable article content, metadata, and markdown from URLs or HTML for research, note taking, and web scraping.
从URL或HTML提取并清理可读文章内容、元数据及Markdown,适用于研究、笔记与网络爬取。
extrastu
效率工具 clawhub v1.0.0 1 版本 99940.7 Key: 无需
★ 1
Stars
📥 1,664
下载
💾 42
安装
1
版本
#latest

概述


name: defuddle-web-cleaner

description: extract clean article content from web pages using defuddle. use when a user provides a url or html and wants the readable article text, markdown version, or structured metadata. helpful for web scraping, research workflows, note taking, obsidian clipping, and converting web pages to markdown.


Defuddle Web Cleaner

Extract the main readable content from a web page.

This skill removes unnecessary elements such as:

  • navigation bars
  • sidebars
  • ads
  • comments
  • footers
  • social buttons

The result is clean article content.

Supported Inputs

  1. URL
  2. Raw HTML
  3. Web page text

Output Format

Default output:

Title

Author

Site

Published date

Markdown article content

Alternative output (JSON):

{

title,

author,

site,

description,

published,

content,

contentMarkdown

}

Processing Steps

  1. Detect input type
  2. Load page HTML
  3. Run Defuddle parser
  4. Extract metadata
  5. Convert to Markdown if requested
  6. Return clean content

Example

Input:

https://example.com/blog/ai

Output:

Title: AI is Changing Everything

Author: Jane Smith

Site: Example Blog

Markdown:

AI is Changing Everything

Artificial intelligence is transforming industries...

Tips

Use this skill when:

  • saving articles to Obsidian
  • building research datasets
  • cleaning webpages for LLM processing
  • summarizing articles

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-19 09:10 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

productivity

Nano Pdf

steipete
使用nano-pdf CLI通过自然语言指令编辑PDF
★ 275 📥 115,002

icp-exemption-skill

extrastu
当检测到用户提及ICP备案、App Store中国区合规、ICP豁免申请等关键词时,自动收集Team ID、账户持有人姓名、App ID等信息,生成符合Apple要求的ICP豁免申请PDF附件。
★ 0 📥 308
productivity

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 440 📥 148,314