← 返回
安全合规 中文

Scraper

Structured extraction and cleanup for public, user-authorized web pages. Use when the user wants to collect, clean, summarize, or transform content from acce...
结构化提取并清洗公开或用户授权的网页内容,适用于收集、清理、汇总、转换等需求。
agistack
安全合规 clawhub v1.0.0 1 版本 99877.8 Key: 无需
★ 0
Stars
📥 1,635
下载
💾 32
安装
1
版本
#latest

概述

Scraper

Turn messy public pages into clean, reusable data.

Core Purpose

Scraper is a safe extraction skill for public, user-authorized pages.

It helps the agent:

  • fetch page content from a URL
  • extract readable text
  • strip boilerplate where possible
  • save clean output locally
  • prepare content for later summarization or analysis

Safety Boundaries

  • Only use on public or user-authorized pages
  • Do not bypass logins, paywalls, captchas, robots restrictions, or rate limits
  • Do not request or store credentials
  • Do not perform stealth scraping, account creation, or identity evasion
  • Save outputs locally only

Runtime Requirements

  • Python 3 must be available as python3
  • No external packages required

Local Storage

All outputs are stored locally under:

  • ~/.openclaw/workspace/memory/scraper/jobs.json
  • ~/.openclaw/workspace/memory/scraper/output/

Key Workflows

  • Capture a page: fetch_page.py --url "https://example.com"
  • Extract readable text: extract_text.py --url "https://example.com"
  • Save cleaned content: save_output.py --url "https://example.com" --title "Example"
  • List prior jobs: list_jobs.py

Scripts

ScriptPurpose
------
init_storage.pyInitialize scraper storage
fetch_page.pyDownload a page with standard headers
extract_text.pyConvert HTML into cleaned plain text
save_output.pySave extracted output and register a job
list_jobs.pyShow past scraping jobs

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 06:27 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

productivity

Terminal

agistack
本地 Shell 助手,提供命令规划、安全执行、预览优先工作流、输出摘要、隐私感知的历史控制以及逐步终端操作。
★ 2 📥 3,692
security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,215 📥 266,524
security-compliance

MoltGuard - Security & Antivirus & Guardrails

thomaslwang
MoltGuard — OpenClaw 安全守卫,由 OpenGuardrails 提供。安装 MoltGuard,保护您和您的用户免受提示注入、数据泄露和恶意攻击。
★ 116 📥 30,720