← 返回
未分类 中文

Web Fetch

Fetch a known URL and return the page as markdown, plain text, raw HTML, or a downloaded image file using the local fetch.js helper. Use this whenever the us...
获取已知 URL 并以 Markdown、纯文本、原始 HTML 或下载的图片文件形式返回,使用本地fetch.js 辅助函数,适用于用户需要获取网页内容的场景。
shikamaru-cc shikamaru-cc 来源
未分类 clawhub v1.0.0 1 版本 99864.1 Key: 无需
★ 0
Stars
📥 735
下载
💾 0
安装
1
版本
#latest

概述

Web Fetch

Use this skill when the target URL is already known and the job is retrieval, not search.

Common cases:

  • fetch a documentation page the user already linked
  • pull an article into readable markdown or plain text before summarizing it
  • inspect raw HTML to debug markup, metadata, or page structure
  • download a remote image to a local path for later inspection or processing

If the user needs help finding the right page first, use web-search before this skill.

Command

Run:

node ./fetch.js --url "https://example.com"

Optional flags:

  • --format markdown|text|html
  • --timeout
  • --output for image responses

Choose the right format

Default to markdown unless the user clearly wants something else.

  • markdown: best for readable docs, articles, and summarization
  • text: best when the user wants the cleanest plain-text extraction
  • html: best when inspecting source markup, metadata, links, embeds, or page structure
  • --output : use when the response is an image and you want a stable saved file path instead of a temp file

Examples:

node ./fetch.js --url "https://example.com/docs" --format markdown
node ./fetch.js --url "https://example.com/page" --format text --timeout 20
node ./fetch.js --url "https://example.com/page" --format html
node ./fetch.js --url "https://example.com/logo.png" --output /tmp/logo.png

How to work with the result

The CLI prints a block.

For text-like responses it includes:

  • title
  • url
  • mime
  • format
  • content

For image responses it includes:

  • title
  • url
  • mime
  • image

After fetching:

  1. Read the returned fields carefully.
  2. Use only the parts relevant to the user's request.
  3. Summarize or quote concise excerpts instead of dumping the whole page unless the user asked for the full content.
  4. If you fetched HTML, explain notable structure or metadata rather than pasting large raw blobs unless needed.
  5. If you downloaded an image, tell the user the saved file path.

Behavior and limits

Keep these in mind while using the tool:

  • http:// URLs are tried as https:// first, then retried as plain HTTP if needed.
  • Default timeout is 30 seconds; maximum is 120 seconds.
  • Responses larger than 5MB are rejected.
  • Browser-like headers are used, with a retry path for some Cloudflare 403 challenge responses.
  • SVG is treated as text, not as a binary image attachment.

Failure handling

If the fetch fails:

  • verify the URL is complete and starts with http:// or https://
  • retry with a different format only if the user's goal changed
  • if the page is too large, blocked, or times out, tell the user plainly and suggest a narrower target URL or an alternate source

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-03 04:09 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Stock Watcher

robin797860
管理和监控个人股票自选列表,支持利用同花顺数据添加、删除、列出股票及汇总近期表现。适用于用户希望追踪特定股票、获取表现汇总或管理自选列表时。
★ 112 📥 46,270
data-analysis

Tavily 搜索

jacky1n7
通过 Tavily API 进行网页搜索(Brave 替代方案)。当用户要求搜索网页、查找来源或链接,且 Brave 网页搜索不可用时使用。
★ 273 📥 100,417
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 210 📥 68,770