← 返回
未分类 Key 中文

smart-search

Intelligent web search routing across Gemini and Brave APIs with quota management, circuit breaker, and web_fetch fallback. Routes finance queries to Gemini,...
智能网页搜索路由,跨接Gemini和Brave API,具备配额管理、断路器和web_fetch备用方案。将金融查询路由至Gemini,...
airaalfredsf airaalfredsf 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 499
下载
💾 0
安装
1
版本
#latest

概述

smart-search

1. WHAT THIS SKILL DOES

This skill routes search queries to the best available provider, manages daily API quota per agent, logs all searches, and degrades gracefully through web scraping when APIs are unavailable. It supports two execution strategies: scheduled agents that prioritise API calls for reliable results, and a general agent that preserves API quota by trying web scraping first. All quota state is persisted to a shared JSON file so multiple agents can coordinate without overrunning daily limits.


2. WHEN TO INVOKE THIS SKILL

Use this skill whenever an agent needs to search the web for information — news, research, stock data, general queries, or any topic requiring up-to-date results.

This skill exposes three tools:

ToolPurpose
------
smart_searchExecute a web search and return results
search_quota_statusCheck remaining quota for an agent or the full system
search_mark_agent_completeRelease unused quota to the shared pool when an agent finishes

3. TOOL REFERENCE

smart_search

Executes a search query using the best available provider for the given agent.

Parameters:

ParameterTypeRequiredDescription
------------
querystringYesThe search query. Max 1000 characters. No control characters.
agent_idstringYesLowercase letters, numbers, and hyphens only (e.g. stock-analysis). Use general for manual/chat searches.
force_providerstringNoOverride routing. Accepts "gemini" or "brave" only.

Return shape:

{
  "results": "<string or array>",
  "provider_used": "gemini | brave | web_fetch",
  "queries_remaining": 12,
  "quota_source": "agent_allocation | shared_pool | provider_direct | none",
  "fallback_used": false,
  "warning": null,
  "web_fetch_engine": null,
  "error": null
}

On failure results is null and error contains the reason.


search_quota_status

Returns quota information for a specific agent or the full quota file.

Parameters:

ParameterTypeRequiredDescription
------------
agent_idstringNoIf omitted, returns the full quota object.

Return shape (single agent):

{
  "agent_id": "stock-analysis",
  "gemini": { "allocated": 15, "used": 3, "remaining": 12 },
  "brave":  { "allocated": 0,  "used": 0, "remaining": 0  },
  "completed": false
}

search_mark_agent_complete

Marks an agent as done for the day and releases any unused allocation to the shared pool for other agents to use.

Parameters:

ParameterTypeRequiredDescription
------------
agent_idstringYesThe agent to mark complete.

Return shape:

{
  "agent_id": "tech-news",
  "released": { "gemini": 7, "brave": 0 },
  "message": "Agent marked complete. 7 Gemini calls released to shared pool."
}

4. TWO EXECUTION STRATEGIES

Strategy A — Scheduled agents (any agent_id except general)

Used by automated agents with a known workload. Prioritises API quality; uses web scraping only as a last resort.

TierProviderCondition
---------
1Primary API (Gemini or Brave)Agent has allocation remaining
2Fallback API (opposite provider)Primary exhausted or failed
3DuckDuckGo web_fetchBoth APIs unavailable — blocked for finance queries
4ErrorAll providers exhausted

Strategy B — General agent (agent_id: "general")

Used for manual chat or ad-hoc queries. Preserves API quota by trying web scraping first. Finance queries are not blocked from web_fetch in this strategy.

TierProviderCondition
---------
1Google web_fetchAlways attempted first
2Bing web_fetchGoogle failed
3API (Gemini or Brave)Both web_fetch failed — quota consumed only here
4ErrorAll providers exhausted

5. PROVIDER ROUTING RULES

Routing is automatic based on query keywords. force_provider overrides all routing.

Routes to Gemini:

Keyword
stock, share price, nse, bse, wse, nyse, nasdaq
earnings, ipo, dividend, funding round, acquisition
merger, valuation, competitor, market share
breaking, just announced, today, hours ago
spearfox, fifthspear

Routes to Brave:

Keyword
space, astronomy, nasa, esa, cosmos

If no keywords match, the query routes to the configured default_provider (defaults to brave). Gemini always wins for finance queries regardless of other keywords.

Keyword lists are configurable in openclaw.json via finance_keywords and brave_keywords arrays.


6. WEB_FETCH ENGINES

EngineUsed byRole
---------
GoogleStrategy B (general)Tier 1 — always tried first
BingStrategy B (general)Tier 2 — tried if Google fails
DuckDuckGoStrategy A (scheduled)Last resort — blocked for finance queries

All engines enforce a 2-second minimum delay between consecutive calls to avoid rate-limiting. Responses are capped at 10,000 characters to protect the model context window. Prompt injection patterns are stripped from all results before they are returned.


7. QUOTA SYSTEM

Each provider has a daily limit set in openclaw.json. Each agent has its own allocation drawn from that pool. Quota is tracked separately per provider (gemini_used, brave_used).

Deduction flow for a single search call:

  1. Agent allocation — deducted first if the agent has remaining quota for the provider
  2. Shared pool — used if the agent allocation is exhausted but other agents have released unused quota
  3. Provider direct — used if the shared pool is empty but the provider daily limit has remaining headroom
  4. Fallback provider — if all sources for the primary provider are exhausted, the opposite provider is tried through the same flow
  5. web_fetch — used if both APIs are unavailable (scheduled agents only; blocked for finance queries)
  6. Error — returned if all paths are exhausted

Config owns the ceilings (daily limits, per-agent allocations). The quota file owns the live counters. Changes to openclaw.json take effect on the next call without restart.

General agent quota is only consumed when both web_fetch attempts fail.


8. CONFIG-DRIVEN BEHAVIOUR

All allocations, limits, and behaviour flags live in openclaw.json under skills.smart-search. Changes take effect on the next skill invocation with no restart required.

{
  "skills": {
    "smart-search": {
      "gemini_model": "gemini-2.0-flash",
      "default_provider": "brave",
      "strict_agents": false,
      "retry_max_attempts": 3,
      "retry_base_delay": 500,
      "finance_keywords": ["stock", "earnings"],
      "brave_keywords": ["space", "astronomy"],
      "providers": {
        "gemini": { "daily_limit": 1500 },
        "brave":  { "daily_limit": 66 }
      },
      "agent_allocations": {
        "stock-analysis": { "gemini": 15, "brave": 0 },
        "general":        { "gemini": 20, "brave": 5 }
      }
    }
  }
}
KeyDefaultEffect
---------
gemini_modelgemini-2.0-flashGemini model used for all API calls
default_providerbraveFallback provider when no keywords match
strict_agentsfalseWhen true, rejects agent IDs not defined in config
retry_max_attempts3Maximum API retry attempts per call
retry_base_delay500Base delay in ms for exponential backoff
finance_keywordsBuilt-in listKeywords that force Gemini routing
brave_keywordsBuilt-in listKeywords that force Brave routing

9. API KEY SECURITY

API keys are read from the env block in openclaw.json and held in memory only for the duration of the call. They are never logged to any file, never included in any error message, and never returned in any response object. Keys are not written to disk by this skill.


10. MANUAL CHAT USAGE

For manual or chat-initiated searches, OpenClaw must pass agent_id: "general". The skill automatically uses web_fetch first for this agent, so API quota is only consumed if both Google and Bing scraping fail. This is an OpenClaw configuration responsibility — the skill does not infer the agent from context.


11. LOG ROTATION

Each day's searches are written to a .jsonl file in the logs/ subdirectory alongside the quota file:

~/.openclaw/workspace/shared/logs/search-YYYY-MM-DD.jsonl

Each line is a JSON object with: timestamp, agent_id, query, provider_used, force_provider, quota_source, queries_remaining, success, response_summary (capped at 500 chars), error, fallback_used, web_fetch_engine.

Log files older than 30 days are deleted automatically based on the filename date, not the filesystem modification time.


12. ERROR REFERENCE

ErrorCauseResolution
---------
query is required and must be a non-empty string.Missing or blank queryPass a non-empty string
query exceeds maximum length of 1000 characters.Query too longShorten the query
query contains invalid control characters.Null bytes or control chars in queryStrip control characters before calling
agent_id is required and must be a non-empty string.Missing agent_idPass a valid agent_id
agent_id must contain only lowercase letters, numbers, and hyphens.Invalid characters in agent_idUse only [a-z0-9-]
Unknown agent: "...". Add it to openclaw.json to use it.strict_agents is true, agent not in configAdd agent to config or disable strict_agents
Invalid force_provider: "...". Use "gemini" or "brave".Unknown provider nameUse gemini or brave only
Perplexity is not yet implemented.force_provider set to perplexityUse gemini or brave
GEMINI_API_KEY is not configured in openclaw.json.Missing API keyAdd key to openclaw.json env block
Brave Search is not configured. Add BRAVE_API_KEY to openclaw.json to enable it.Missing API keyAdd key to openclaw.json env block
Gemini model ... is unavailable.Model name incorrect or deprecatedUpdate gemini_model in config
Gemini API error: 4xxBad request or auth failureCheck API key and query
Brave API error: 4xxBad request or auth failureCheck API key
Gemini returned an empty response.API responded but returned no textRetry or check query
Brave returned no results.API returned empty results arrayTry a different query
Finance and time-sensitive queries require Gemini or Brave API.Finance query hit the web_fetch fallback blockEnsure API quota is available for finance queries
All search providers exhausted or unavailable.All tiers failedCheck API keys, quota, and network
... circuit is open. Try again later.Provider hit 3 consecutive failuresWait 60 seconds for circuit to reset
Quota file is locked. Try again.Concurrent write contentionRetry the call
Cannot read openclaw.json. Check OPENCLAW_CONFIG_PATH.Config file missing or unreadableCheck file path and permissions
Invalid JSON input.stdin payload was not valid JSONEnsure the harness sends valid JSON
Unknown tool: ...Tool name not recognisedUse smart_search, search_quota_status, or search_mark_agent_complete
Agent ... not found in quota.agent_id not in quota fileCall smart_search first to register the agent
Agent ... is already marked complete for today.Double-complete callNo action needed; agent is already released

13. EXAMPLES

Example 1 — Manual chat finance query, Google web_fetch succeeds

{ "tool": "smart_search", "args": { "query": "today AAPL stock price", "agent_id": "general" } }

Strategy B: Google web_fetch is tried first. Succeeds. Zero API quota consumed.

{
  "results": "<html content from Google>",
  "provider_used": "web_fetch",
  "queries_remaining": 20,
  "quota_source": "none",
  "fallback_used": true,
  "warning": "Result retrieved via web scraping. Quality may be lower than API results.",
  "web_fetch_engine": "google"
}

Example 2 — Manual chat finance query, both web_fetch fail, API fallback triggered

{ "tool": "smart_search", "args": { "query": "today AAPL stock price", "agent_id": "general" } }

Strategy B: Google fails, Bing fails. Gemini API is called (finance keyword routes to Gemini). Quota consumed.

{
  "results": "Apple (AAPL) is trading at $189.42...",
  "provider_used": "gemini",
  "queries_remaining": 19,
  "quota_source": "agent_allocation",
  "fallback_used": false,
  "warning": null,
  "web_fetch_engine": null
}

Example 3 — Manual chat general query, Google web_fetch succeeds

{ "tool": "smart_search", "args": { "query": "best noise-cancelling headphones 2025", "agent_id": "general" } }

Strategy B: No finance keywords. Google web_fetch tried first. Succeeds. Zero API quota consumed.

{
  "results": "<html content from Google>",
  "provider_used": "web_fetch",
  "queries_remaining": 20,
  "quota_source": "none",
  "fallback_used": true,
  "warning": "Result retrieved via web scraping. Quality may be lower than API results.",
  "web_fetch_engine": "google"
}

Example 4 — Scheduled agent finance query, Gemini API, agent has allocation

{ "tool": "smart_search", "args": { "query": "NSE market open today", "agent_id": "stock-analysis" } }

Strategy A: Finance keyword routes to Gemini. Agent has 12 remaining. API call succeeds.

{
  "results": "The NSE opened at 22,450 points today...",
  "provider_used": "gemini",
  "queries_remaining": 11,
  "quota_source": "agent_allocation",
  "fallback_used": false,
  "warning": null,
  "web_fetch_engine": null
}

Example 5 — Scheduled agent, both APIs exhausted, finance query → hard error

{ "tool": "smart_search", "args": { "query": "NSE market open today", "agent_id": "stock-analysis" } }

Strategy A: Gemini exhausted, Brave exhausted. Finance keyword blocks DuckDuckGo fallback.

{
  "results": null,
  "provider_used": null,
  "queries_remaining": 0,
  "quota_source": "none",
  "fallback_used": false,
  "warning": null,
  "web_fetch_engine": null,
  "error": "Finance and time-sensitive queries require Gemini or Brave API. web_fetch fallback is not permitted for this query type."
}

Example 6 — Scheduled agent, both APIs exhausted, general query → DuckDuckGo web_fetch

{ "tool": "smart_search", "args": { "query": "latest AI research papers", "agent_id": "tech-news" } }

Strategy A: Gemini exhausted, Brave exhausted. No finance keywords — DuckDuckGo web_fetch is permitted.

{
  "results": "<html content from DuckDuckGo>",
  "provider_used": "web_fetch",
  "queries_remaining": 0,
  "quota_source": "none",
  "fallback_used": true,
  "warning": "Result retrieved via web scraping. Quality may be lower than API results.",
  "web_fetch_engine": "duckduckgo"
}

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-02 12:05 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

dev-programming

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 686 📥 330,936
dev-programming

YouTube

byungkyu
使用托管OAuth集成YouTube Data API,支持搜索视频、管理播放列表、获取频道数据及评论互动,适用于用户需要时使用此技能。
★ 142 📥 42,095
dev-programming

Mcporter

steipete
使用 mcporter CLI 直接列出、配置、认证及调用 MCP 服务器/工具(支持 HTTP 或 stdio),涵盖临时服务器、配置编辑及 CLI/类型生成功能。
★ 198 📥 68,200