Scraper

Structured extraction and cleanup for public, user-authorized web pages. Use when the user wants to collect, clean, summarize, or transform content from acce...

结构化提取并清洗公开或用户授权的网页内容，适用于收集、清理、汇总、转换等需求。

agistack

安全合规 clawhub v1.0.0 1 版本 99877.8 Key: 无需

★ 0

Stars

📥 1,635

下载

💾 32

安装

版本

#latest

概述

Scraper

Turn messy public pages into clean, reusable data.

Core Purpose

Scraper is a safe extraction skill for public, user-authorized pages.

It helps the agent:

fetch page content from a URL
extract readable text
strip boilerplate where possible
save clean output locally
prepare content for later summarization or analysis

Safety Boundaries

Only use on public or user-authorized pages
Do not bypass logins, paywalls, captchas, robots restrictions, or rate limits
Do not request or store credentials
Do not perform stealth scraping, account creation, or identity evasion
Save outputs locally only

Runtime Requirements

Python 3 must be available as python3
No external packages required

Local Storage

All outputs are stored locally under:

~/.openclaw/workspace/memory/scraper/jobs.json
~/.openclaw/workspace/memory/scraper/output/

Key Workflows

Capture a page: fetch_page.py --url "https://example.com"
Extract readable text: extract_text.py --url "https://example.com"
Save cleaned content: save_output.py --url "https://example.com" --title "Example"
List prior jobs: list_jobs.py

Scripts

Script	Purpose
---	---
`init_storage.py`	Initialize scraper storage
`fetch_page.py`	Download a page with standard headers
`extract_text.py`	Convert HTML into cleaned plain text
`save_output.py`	Save extracted output and register a job
`list_jobs.py`	Show past scraping jobs

版本历史

共 1 个版本

v1.0.0 当前

2026-03-29 06:27 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

productivity

Terminal

agistack

本地 Shell 助手，提供命令规划、安全执行、预览优先工作流、输出摘要、隐私感知的历史控制以及逐步终端操作。

★ 2 📥 3,692

security-compliance

Skill Vetter

spclaudehome

AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前，检查风险信号、权限范围及可疑模式。

★ 1,215 📥 266,524

security-compliance

MoltGuard - Security & Antivirus & Guardrails

thomaslwang

MoltGuard — OpenClaw 安全守卫，由 OpenGuardrails 提供。安装 MoltGuard，保护您和您的用户免受提示注入、数据泄露和恶意攻击。

★ 116 📥 30,720