Use the published clawfetch npm CLI to convert a single web page into
normalized markdown with a small metadata header.
This skill is a thin wrapper around the public clawfetch package:
clawfetch npm package in this skill directory viaa separate, explicit bootstrap step.
Typical use cases for agents:
The CLI emits a header like:
--- METADATA ---
Title: ...
Author: ...
Site: ...
FinalURL: ...
Extraction: readability|fallback-container|body-innerText|github-raw-fast-path|reddit-rss
FallbackSelector: ... # only when not readability
--- MARKDOWN ---
<markdown>
This skill assumes:
node / npm are available in the OpenClaw container.clawfetch package and its dependencies.Use the OpenClaw CLI to pull the skill from ClawHub into your active workspace:
openclaw skills install clawfetch
This creates a directory similar to:
~/.openclaw/workspace/skills/clawfetch
At this point only the skill wrapper files exist (SKILL.md, README, manifest,
bootstrap_deps.sh, etc.). The actual npm package with the scraper logic is
not installed yet.
From the skill directory, run the bootstrap script to install the npm package
locally:
cd ~/.openclaw/workspace/skills/clawfetch
bash bootstrap_deps.sh
This script is small and reviewable; it only calls:
npm install clawfetch@0.1.7
Key points:
skills install does not currently auto-run this bootstrap step. Agents or humans must call bootstrap_deps.sh explicitly.
modified.
After this completes, the CLI entrypoint is available at:
~/.openclaw/workspace/skills/clawfetch/node_modules/clawfetch/clawfetch.js
After the bootstrap step, agents can invoke the CLI from this directory as:
node node_modules/clawfetch/clawfetch.js <url> [--max-comments N] [--no-reddit-rss]
Recommended patterns:
then uses Readability + Turndown to extract the main article body.
https://github.com/owner/repo). raw.githubusercontent.com before falling back to browser mode.
https://www.reddit.com/r/.../comments/.../)..rss and fetch the Atom feed using a normaldesktop browser User-Agent.
## Post: ...).### Comment by ...), limited by --max-comments (default 50; 0 means no limit).
--no-reddit-rss to force browser scraping instead of RSS.npm install in thebootstrap step.
published clawfetch package and its declared npm dependencies.
Agents should treat this skill as a read-mostly web → markdown tool and
avoid using it for arbitrary scripting or filesystem operations beyond its
intended CLI interface.
共 1 个版本