← 返回
未分类 Key

Firecrawl

Scrape, search, map, and crawl the web for AI agents via the Firecrawl API. Use when your agent needs clean markdown from JS-heavy or SPA sites, search resul...
通过 Firecrawl API 为 AI 代理抓取、搜索、映射和爬取网页,适用于需要从 JS 重度或 SPA 站点获取干净 Markdown 的场景。
dyagil
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 224
下载
💾 0
安装
1
版本
#latest

概述

Firecrawl Skill — Web Data for Agents

Principle

Firecrawl is a managed API for the three cases where a basic web_fetch agent tool falls short:

  1. JS-heavy sites — React/SPA pages that need a real browser to render.
  2. Search with full content — standard search APIs return only snippets; Firecrawl returns rendered markdown per result.
  3. Map / Crawl — list every URL on a site, mirror a docs tree, etc.

When to Use

TaskRecommended
------
Plain HTML articleweb_fetch (free, fast)
Quick searchbasic web search tool (free)
JS-heavy / SPA pagefc scrape
Search + full contentfc search --scrape
Full doc-site crawlfc crawl
URL inventory of a sitefc map
Page needing clicks / loginfc interact (REST only, see below)

Rule of thumb: if web_fetch returns empty markup or noise, escalate to fc scrape.

Commands

scrape — single page

fc scrape https://example.com                            # markdown to stdout
fc scrape https://example.com --format html --out a.html
fc scrape https://example.com --out ~/scraped/page.md

search

fc search "best running shoes 2026" --limit 5
fc search "topic" --limit 3 --scrape           # include full content of each hit
fc search "..." --out results.json

map — list URLs on a site

fc map https://docs.example.com --limit 200
fc map https://example.com --out urls.txt

crawl — deep crawl of many pages

fc crawl https://docs.example.com --limit 50 --out ~/docs-mirror/

⚠️ Takes minutes. Each page = 1 credit. Don't blast it on a tight free-tier budget.

ask / docs — debug and help

fc ask "why did my scrape return empty?" --jobId abc123
fc docs "how do I bypass cloudflare?"

status

fc status   # prints key prefix + a smoke-test scrape

Pricing (Remember This)

  • Free tier: 500 credits / month.
  • Each scrape or search result ≈ 1 credit.
  • Each crawl page = 1 credit.
  • Past the free tier you pay.

Before large crawl/map calls with high --limit, warn the user about credit cost.

Recommended Patterns

Pattern 1 — Regular article

# Try the free tool first.
# If it returns empty / unreadable:
fc scrape <url> --out /tmp/article.md

Pattern 2 — Research on a topic

fc search "<topic>" --limit 5 --scrape --out /tmp/research.json
# Then parse the JSON and pull what you need.

Pattern 3 — Whole docs site

fc map https://docs.example.com --limit 100    # get the URL list
# Pick relevant URLs and:
fc scrape <selected_urls>...
# Or in bulk:
fc crawl https://docs.example.com --limit 20 --out ~/mirror/

Credentials

  • Key: store at ~/.openclaw/credentials/firecrawl/api_key (chmod 600).
  • Base URL: https://api.firecrawl.dev/v2
  • Auth: Authorization: Bearer fc-...
  • CLI: ~/bin/fc → your local fc.cjs (Node CommonJS, zero dependencies).

Get a key at: https://firecrawl.dev (free tier available).

Direct REST (when the CLI isn't enough)

Example — interact (clicks / form-fill) which isn't wrapped in the CLI yet:

KEY=$(cat ~/.openclaw/credentials/firecrawl/api_key)
curl -X POST https://api.firecrawl.dev/v2/scrape \
  -H "Authorization: Bearer $KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com/login",
    "formats": ["markdown"],
    "actions": [
      {"type": "wait", "milliseconds": 1000},
      {"type": "click", "selector": "#login-button"},
      {"type": "write", "selector": "#email", "text": "..."}
    ]
  }'

Full docs: https://docs.firecrawl.dev

Don'ts

  • Don't crawl a whole site without --limit — credits burn fast.
  • Don't reach for Firecrawl when web_fetch works — wasteful.
  • Don't commit the API key to git.
  • Don't use Firecrawl for YouTube transcripts — use a dedicated transcription tool.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-21 15:44 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

Oganim Deploy

dyagil
开发、部署并验证对 Vercel + Supabase 项目(营销网站、CRM 和客户门户)的更改。适用于代理需要发布代码时。
★ 0 📥 296

Services Watchdog

dyagil
设置基于 systemd 的看门狗,使 Node.js 长运行服务(如 Telegram 机器人、Express 仪表盘等)在 shell 退出、SSH 断开等情况下保持运行。
★ 0 📥 271

Supabase Security Audit

dyagil
审计 Supabase + Vercel 项目的 RLS 覆盖、权限提升、跨客户数据泄露、匿名暴露、魔法链接流程正确性以及 HTT 相关安全。
★ 0 📥 289