← 返回
未分类 Key

OpenAI Deep Research Skill

Execute multi-step deep research with the OpenAI Responses API, including question decomposition, evidence gathering with web search, contradiction tracking,...
使用 OpenAI Responses API 执行多步深度研究,包括问题分解、网络搜索收集证据、矛盾追踪...
guanglechen guanglechen 来源
未分类 clawhub v0.1.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 324
下载
💾 0
安装
1
版本
#latest

概述

OpenAI Deep Research

Overview

Run a deterministic research workflow that separates planning, evidence collection, and report synthesis.

Generate reusable research artifacts under an output directory for auditability and iteration.

Workflow

  1. Define research scope.
  2. Run the script to generate plan, findings, and report artifacts.
  3. Evaluate report quality with the checklist.
  4. Rerun with adjusted depth/model/tool settings when gaps remain.

Quick Start

Install dependencies:

cd openai-deep-research-skill
python3 -m pip install -r scripts/requirements.txt

Run a real research job:

python3 scripts/deep_research.py "中国AI Agent市场2026年商业化路径" \
  --language zh-CN \
  --depth 6 \
  --research-depth deep \
  --max-total-output-tokens 20000 \
  --parallel 3

Run a local dry-run without API calls:

python3 scripts/deep_research.py "sample topic" --dry-run

Runtime Inputs

Set OPENAI_API_KEY before running real jobs.

Use OPENAI_BASE_URL only when routing through a compatible gateway.

Tune key flags:

  • --depth: Control breadth of decomposition (2-12).
  • --research-depth: Control per-question evidence depth (shallow|standard|deep).
  • --parallel: Control concurrent evidence runs (1-8).
  • --planner-model: Choose planning model.
  • --research-model: Choose evidence model.
  • --writer-model: Choose synthesis model.
  • --planner-max-output-tokens: Cap planner response size.
  • --research-max-output-tokens: Cap each sub-question research response size.
  • --writer-max-output-tokens: Cap final report synthesis response size.
  • --max-total-output-tokens: Hard limit for estimated run output tokens.
  • --disable-web-search: Disable web tool for internal-data-only runs.
  • --web-tool-type: Override tool type when endpoint uses a non-default web-search tool name.

Artifact Contract

Write one run directory per execution: outputs/-/.

Produce these files:

  • run_meta.json: runtime parameters and metadata.
  • plan.json: normalized sub-question plan.
  • plan_raw.txt: raw planner model output.
  • findings.json: per-question evidence summaries.
  • research_raw.json: raw responses per sub-question.
  • report.md: final cited report.

Quality Gate

Apply all checks before accepting report.md:

  1. Verify each sub-question has explicit evidence or explicit gap notes.
  2. Verify source links are absolute URLs and point to relevant content.
  3. Verify contradictory evidence is surfaced in Contradictions and Uncertainty.
  4. Verify recommendation statements are specific and actionable.
  5. Verify weak-confidence sections are marked clearly.
  6. Verify all required top-level sections exist in Markdown (Executive Summary, Key Findings, Evidence by Sub-question, Contradictions and Uncertainty, Recommendations, Sources).

Use references/research-quality.md for scoring rubric and iteration guidance.

Troubleshooting

If execution fails with missing package errors, install dependencies from scripts/requirements.txt.

If JSON parsing fails, rerun with the same topic and lower --depth, then inspect plan_raw.txt or research_raw.json.

If web-search tool type is rejected, pass a compatible value via --web-tool-type or disable web search.

版本历史

共 1 个版本

  • v0.1.0 当前
    2026-05-07 10:51 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,379 📥 320,526
ai-agent

self-improving agent

pskoett
捕获经验教训、错误及修正内容,以实现持续改进。适用于以下场景:(1)命令或操作意外失败;(2)用户纠正Claude(如“不,那不对……”“实际上……”);(3)用户请求的功能不存在;(4)外部API或工具出现故障;(5)Claude发现自身
★ 4,082 📥 810,928
dev-programming

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 676 📥 325,514