← 返回
内容创作 中文

Safe-Web

Securely fetch and search web content by scanning and blocking prompt injection threats using PromptGuard before returning results.
利用 PromptGuard 扫描并拦截提示注入威胁,安全获取和搜索网页内容后返回结果。
adamnaghs
内容创作 clawhub v1.0.8 1 版本 99845.7 Key: 无需
★ 1
Stars
📥 1,921
下载
💾 71
安装
1
版本
#latest

概述

safe-web

Secure web fetch and search with PromptGuard scanning.

Status

✅ Working

Purpose

Protects against prompt injection attacks hidden in web content before returning it to the AI. Wraps web fetching and searching with security scanning.

Installation

Requires PromptGuard and Python dependencies:

# Install PromptGuard first
cd /home/linuxbrew/.openclaw/workspace/skills/prompt-guard
pip3 install --break-system-packages -e .

# Install web dependencies (if not present)
pip3 install --break-system-packages requests beautifulsoup4

Usage

Fetch Command

Fetch a URL and scan the content:

# Basic fetch
safe-web fetch https://example.com/article

# Save to file
safe-web fetch https://example.com --output article.txt

# JSON output for automation
safe-web fetch https://example.com --json

# Strict mode (block on MEDIUM)
safe-web fetch https://example.com --strict

Search Command

Search the web and scan results:

# Basic search
safe-web search "AI safety research"

# More results
safe-web search "stock market news" --count 10

# JSON output
safe-web search "machine learning" --json

Exit Codes

CodeMeaning
---------------
0Success - content/results are clean
1Error (network, parsing, etc.)
2Threat detected - content blocked

Configuration

Environment Variables

  • BRAVE_API_KEY - API key for Brave Search (optional, enables search command)
  • Get one at: https://brave.com/search/api/

Symlink (Recommended)

Create a system-wide symlink so safe-web works from any directory:

sudo ln -s /home/linuxbrew/.openclaw/workspace/skills/safe-web/scripts/safe-web.py /usr/local/bin/safe-web

After creating the symlink, you can use safe-web directly without specifying the full path.

How It Works

Fetch Flow

  1. Downloads URL content with requests
  2. Extracts text using BeautifulSoup (removes scripts, styles)
  3. Scans extracted text with PromptGuard
  4. Returns clean content or blocks with SHIELD report

Search Flow

  1. Queries Brave Search API (requires API key)
  2. Scans each result title and description
  3. Filters out suspicious results
  4. Returns only clean results

Security Model

Fail-closed: If PromptGuard cannot be loaded or scanning fails, the tool reports an error rather than returning unverified content.

Content sanitization: HTML is parsed and scripts/styles are removed before scanning to reduce false positives.

No execution: This tool only fetches and scans. It never executes JavaScript or runs commands found in web content.

Example Output

Clean Fetch

Fetching: https://site.com/article
Fetched 1523 characters
Scanning with PromptGuard...

Article content here...

Blocked Content

Fetching: https://suspicious-site.com
Fetched 2048 characters
Scanning with PromptGuard...
============================================================
🛡️  SAFE-WEB SECURITY ALERT
============================================================
Source: https://suspicious-site.com
Severity: CRITICAL
Action: BLOCK_NOTIFY
Patterns Matched: 8

Detected Patterns:
  - instruction_override_en
  - role_manipulation_en
  - system_impersonation_en
============================================================

Content from https://suspicious-site.com has been blocked.

Search Results

Searching: AI research
Found 5 results, scanning...

Showing 3 clean results:

1. Latest AI Research Papers
   URL: https://arxiv.org/list/ai/recent
   Recent submissions in artificial intelligence...

2. AI Safety Institute
   URL: https://www.safe.ai/
   Research and development for safe AI systems...

When to Use

Use safe-web when:

  • Fetching content from untrusted URLs
  • Scraping web pages for analysis
  • Searching and processing web results
  • Any web content will enter the AI context window

Use standard web_fetch/web_search tools only for:

  • Trusted, known-safe domains
  • Internal documentation sites
  • When you explicitly want to bypass scanning

Comparison with Native Tools

FeatureNative web_fetchsafe-web fetch
----------------------------------------------
Fetches HTML
Extracts text
Injection scanning
JSON output
Save to file
Exit codes0/10/1/2 (security)

Dependencies

  • Python 3.8+
  • PromptGuard 3.1.0+ (installed in workspace)
  • requests
  • beautifulsoup4
  • Brave Search API key (for search command)

Limitations

  • Search requires Brave API key (free tier available)
  • Fetch does not execute JavaScript (static HTML only)
  • Large pages may be truncated during text extraction
  • Network timeouts default to 30 seconds

版本历史

共 1 个版本

  • v1.0.8 当前
    2026-03-28 19:35 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,167
content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,582
content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,456