← 返回
内容创作 Key 中文

Tavily Extract

Extract content from specific URLs using Tavily's extraction API. Returns clean markdown/text from web pages.
使用Tavily提取API从特定URL提取内容,返回干净的markdown或文本格式。
matthew77
内容创作 clawhub v1.0.0 1 版本 99856.3 Key: 需要
★ 1
Stars
📥 675
下载
💾 17
安装
1
版本
#latest

概述

Tavily Extract

Extract clean content from specific URLs. Ideal when you know which pages you want content from.

Authentication

Get your API key at https://tavily.com and add to your OpenClaw config:

{
  "skills": {
    "entries": {
      "tavily-extract": {
        "enabled": true,
        "apiKey": "tvly-YOUR_API_KEY_HERE"
      }
    }
  }
}

Or set in environment variable:

export TAVILY_API_KEY="tvly-YOUR_API_KEY_HERE"

Quick Start

Using the Script

node {baseDir}/scripts/extract.mjs "https://example.com/article"
node {baseDir}/scripts/extract.mjs "url1,url2,url3"
node {baseDir}/scripts/extract.mjs "url" --query "authentication API"

Examples

# Single URL
node {baseDir}/scripts/extract.mjs "https://docs.python.org/3/tutorial/classes.html"

# Multiple URLs
node {baseDir}/scripts/extract.mjs "https://example.com/page1,https://example.com/page2"

# With query focus
node {baseDir}/scripts/extract.mjs "https://example.com/docs" --query "authentication API"

# Advanced extraction for JS pages
node {baseDir}/scripts/extract.mjs "https://app.example.com" --depth advanced --timeout 60

Options

OptionDescriptionDefault
------------------------------
--query Rerank chunks by relevance-
--chunks Chunks per URL (1-5, requires query)3
--depth Extract depth: basic or advancedbasic
--format Output format: markdown or textmarkdown
--timeout Max wait time (1-60 seconds)varies
--jsonOutput raw JSONfalse

Extract Depth

DepthWhen to Use
--------------------
basicSimple text extraction, faster
advancedDynamic/JS-rendered pages, tables, structured data

Tips

  • Max 20 URLs per request - batch larger lists
  • Use --query + --chunks to get only relevant content
  • Try basic first, fall back to advanced if content is missing
  • Set longer --timeout for slow pages (up to 60s)
  • Check failed_results in JSON output for URLs that couldn't be extracted

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 01:13 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Tavily Search

matthew77
使用Tavily的LLM优化API进行网络搜索,返回包含内容片段、评分和元数据的相关结果。
★ 119 📥 45,608
content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,867
content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,493