← 返回
未分类 中文

Scout

Structured OSINT research on people, companies, and organizations. Use when the user wants a provenance-backed brief, entity resolution across public sources...
针对人物、公司和机构的结构化开源情报(OSINT)调研,适用于需要来源追溯的简报及跨公开来源的实体解析。
indigokarasu
未分类 clawhub v2.3.0 2 版本 99882.8 Key: 无需
★ 1
Stars
📥 832
下载
💾 2
安装
2
版本
#latest

概述

Scout

Scout conducts lawful OSINT research on people, companies, and organizations, assembling provenance-backed briefs where every claim carries a source reference, retrieval timestamp, and direct quote. It works through a tiered source waterfall — public web first, then rate-limited registries, then paid databases only with explicit permission — collecting no more than the stated research goal requires.

When to use

  • Research a person and build a source-backed brief
  • Do background research on a company using public sources
  • Resolve whether two profiles are the same person with cited sources
  • Compile what is publicly knowable about a subject
  • Expand a quick lookup into an auditable brief

When not to use

  • Illegal intrusion into private systems
  • Credential theft or bypassing access controls
  • Covert surveillance
  • Speculative doxxing
  • Topic research without a person/org focus — use Sift

Responsibility boundary

Scout owns lawful OSINT research on people and organizations with provenance-backed output.

Scout does not own: general topic research (Sift), image processing (Look), knowledge graph writes (Elephas), social graph (Weave), communications (Dispatch).

Commands

  • scout.research.start — begin a new research request with subject and goal
  • scout.research.expand --tier <1|2|3> — escalate to a higher source tier
  • scout.brief.render — generate the final markdown brief with findings and sources
  • scout.brief.render_pdf — optional PDF brief generation
  • scout.status — return current research state
  • scout.journal — write journal for the current run; called at end of every run
  • scout.update — pull latest from GitHub source; preserves journals and data

Invariants

  1. Legality-first — only publicly available sources without bypassing access controls
  2. Minimization — collect only what the research goal requires
  3. Provenance for every claim — at least one source reference with URL, retrieval timestamp, and quote
  4. Paid sources require explicit permission — Tier 3 needs a recorded PermissionGrant
  5. No doxxing by default — private details suppressed unless explicitly permitted
  6. Uncertainty must be surfaced — incomplete identity resolution stated clearly

Input contract

ResearchRequest requires: request_id, as_of, subject (type, name, aliases, known_locations, known_handles), goal, constraints (time_budget_minutes, minimize_pii).

Read references/scout_schemas.md for exact schema.

Research workflow

  1. Normalize request and subject identity inputs
  2. Resolve likely identity matches conservatively
  3. Run Tier 1 public-source collection
  4. Record provenance for every retained claim
  5. Compile preliminary findings with confidence levels
  6. Escalate to Tier 2 only if enabled and useful
  7. Escalate to Tier 3 only after explicit permission grant is recorded
  8. Generate brief with findings, uncertainty, and source log
  9. Store request, findings, sources, and decisions locally
  10. Emit Signal files for confirmed entities and relationships to ~/openclaw/db/ocas-elephas/intake/{signal_id}.signal.json. Use Signal schema from spec-ocas-shared-schemas.md. One file per entity or relationship with sufficient confidence.
  11. Write journal via scout.journal

When minimize_pii=true, suppress unnecessary sensitive details in the final brief.

Source waterfall

Read references/scout_source_waterfall.md for full tier logic.

  • Tier 1 — public web, official sites, news, filings, public social profiles. Automatic.
  • Tier 2 — rate-limited sources, registries, extended datasets. Only if enabled and useful.
  • Tier 3 — paid OSINT providers, background databases. Requires explicit permission grant.

Output requirements

Markdown brief with: Executive Summary, Identity Resolution Notes, Findings, Risk and Uncertainty, Source Log. Every finding carries source-backed provenance.

Inter-skill interfaces

Scout writes Signal files to Elephas intake: ~/openclaw/db/ocas-elephas/intake/{signal_id}.signal.json

Emit one Signal file per confirmed entity or high-confidence relationship discovered during research. Use the Signal schema from spec-ocas-shared-schemas.md. Elephas decides promotion.

See spec-ocas-interfaces.md for signal format.

Storage layout

~/openclaw/data/ocas-scout/
  config.json
  requests.jsonl
  sources.jsonl
  findings.jsonl
  decisions.jsonl
  briefs/
  reports/

~/openclaw/journals/ocas-scout/
  YYYY-MM-DD/
    {run_id}.json

Default config.json:

{
  "skill_id": "ocas-scout",
  "skill_version": "2.3.0",
  "config_version": "1",
  "created_at": "",
  "updated_at": "",
  "waterfall": {
    "enabled_tiers": [1, 2]
  },
  "paid_sources": {
    "enabled": false
  },
  "brief": {
    "format": "markdown"
  },
  "retention": {
    "days": 90,
    "max_records": 10000
  }
}

OKRs

Universal OKRs from spec-ocas-journal.md apply to all runs.

skill_okrs:
  - name: verified_claim_ratio
    metric: fraction of findings with at least one verified source reference
    direction: maximize
    target: 0.70
    evaluation_window: 30_runs
  - name: entity_resolution_accuracy
    metric: fraction of identity resolutions confirmed correct
    direction: maximize
    target: 0.90
    evaluation_window: 30_runs
  - name: source_diversity
    metric: median unique source domains per brief
    direction: maximize
    target: 6
    evaluation_window: 30_runs

Optional skill cooperation

  • Weave — read social graph (read-only) for identity context
  • Elephas — optionally emit Signal files for Chronicle promotion
  • Sift — may use Sift for web searches during research

Journal outputs

  • Observation Journal — research runs producing findings
  • Research Journal — structured multi-source research sessions

Visibility

public

Initialization

On first invocation of any Scout command, run scout.init:

  1. Create ~/openclaw/data/ocas-scout/ and all subdirectories (briefs/, reports/)
  2. Write default config.json with ConfigBase fields if absent
  3. Create empty JSONL files: requests.jsonl, sources.jsonl, findings.jsonl, decisions.jsonl
  4. Create ~/openclaw/journals/ocas-scout/
  5. Ensure ~/openclaw/db/ocas-elephas/intake/ exists (create if missing)
  6. Register cron job scout:update if not already present (check openclaw cron list first)
  7. Log initialization as a DecisionRecord in decisions.jsonl

Background tasks

Job nameMechanismScheduleCommand
------------
scout:updatecron0 0 * (midnight daily)scout.update
openclaw cron add --name scout:update --schedule "0 0 * * *" --command "scout.update" --sessionTarget isolated --lightContext true --timezone America/Los_Angeles

Self-update

scout.update pulls the latest package from the source: URL in this file's frontmatter. Runs silently — no output unless the version changed or an error occurred.

  1. Read source: from frontmatter → extract {owner}/{repo} from URL
  2. Read local version from skill.json
  3. Fetch remote version: gh api "repos/{owner}/{repo}/contents/skill.json" --jq '.content' | base64 -d | python3 -c "import sys,json;print(json.load(sys.stdin)['version'])"
  4. If remote version equals local version → stop silently
  5. Download and install:

```bash

TMPDIR=$(mktemp -d)

gh api "repos/{owner}/{repo}/tarball/main" > "$TMPDIR/archive.tar.gz"

mkdir "$TMPDIR/extracted"

tar xzf "$TMPDIR/archive.tar.gz" -C "$TMPDIR/extracted" --strip-components=1

cp -R "$TMPDIR/extracted/"* ./

rm -rf "$TMPDIR"

```

  1. On failure → retry once. If second attempt fails, report the error and stop.
  2. Output exactly: I updated Scout from version {old} to {new}

Support file map

FileWhen to read
------
references/scout_schemas.mdBefore creating requests, findings, or briefs
references/scout_source_waterfall.mdBefore tier selection or escalation decisions
references/scout_brief_template.mdBefore rendering briefs
references/journal.mdBefore scout.journal; at end of every run

版本历史

共 2 个版本

  • v2.3.0 当前
    2026-05-01 04:47 安全 安全
  • v2.0.0
    2026-03-29 23:29 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

Mentor

indigokarasu
用于长时间运行的多技能工作流的自我改进型编排与评估引擎。分析日志、评估变体并提出技能改进建议...
★ 0 📥 706

Vesper

indigokarasu
每日简报生成器。整合系统各处的信号,生成简洁的早间和晚间简报,呈现结果、机会和决策...
★ 0 📥 733

Elephas

indigokarasu
长期知识图谱(Chronicle)维护。摄取系统日志结构化信号,解析实体身份,确认事实并提升...
★ 0 📥 704