Turn messy public pages into clean, reusable data.
Scraper is a safe extraction skill for public, user-authorized pages.
It helps the agent:
python3All outputs are stored locally under:
~/.openclaw/workspace/memory/scraper/jobs.json~/.openclaw/workspace/memory/scraper/output/fetch_page.py --url "https://example.com"extract_text.py --url "https://example.com"save_output.py --url "https://example.com" --title "Example"list_jobs.py| Script | Purpose |
|---|---|
| --- | --- |
init_storage.py | Initialize scraper storage |
fetch_page.py | Download a page with standard headers |
extract_text.py | Convert HTML into cleaned plain text |
save_output.py | Save extracted output and register a job |
list_jobs.py | Show past scraping jobs |
共 1 个版本