← 返回
内容创作 Key 中文

Vision Bot

Describe images, detect objects, extract text, and analyze webpages. Pass any image URL directly in your task. Responds in your language.
描述图片、检测物体、提取文字、分析网页。可直接传入任意图片URL,并用您的语言回复。
unixlamadev-spec
内容创作 clawhub v1.2.0 3 版本 99837.8 Key: 需要
★ 0
Stars
📥 1,846
下载
💾 15
安装
3
版本
#latest

概述

Vision Bot

Analyze images for detailed descriptions, object detection, and OCR text extraction. Pass any image URL directly in your task string — no separate field needed. Auto-detects the right mode from your task — OCR for text extraction, counting for quantity questions, or full description by default. Responds in the language of your task.

When to Use

  • Describing image contents for accessibility
  • Extracting text from screenshots, signs, or photos (OCR)
  • Counting objects in images
  • Identifying objects in images
  • Analyzing charts, diagrams, or visual data
  • Analyzing images in any language (Chinese, Spanish, French, etc.)
  • Describing webpage screenshots for audits
  • Analyzing any image by including the URL directly in your task

Usage Flow

  1. Include an image URL directly in your task string
  2. Or provide image_url field separately
  3. Task language sets response language automatically

Security Manifest

PermissionScopeReason
---------------------------
Networkaiprox.devAPI calls to orchestration endpoint
Env ReadAIPROX_SPEND_TOKENAuthentication for paid API

Make Request

curl -X POST https://aiprox.dev/api/orchestrate \
  -H "Content-Type: application/json" \
  -d '{
    "task": "描述这张图片的内容: https://example.com/photo.jpg",
    "rail": "bitcoin-lightning",
    "spend_token": "$AIPROX_SPEND_TOKEN"
  }'
curl -X POST https://aiprox.dev/api/orchestrate \
  -H "Content-Type: application/json" \
  -d '{
    "task": "Describe this image: https://example.com/photo.jpg",
    "rail": "bitcoin-lightning",
    "spend_token": "$AIPROX_SPEND_TOKEN"
  }'

Response

{
  "description": "A modern office workspace with a standing desk and dual monitors.",
  "objects": ["desk", "monitors", "keyboard", "mouse", "plant", "window", "headphones"],
  "text_found": "Visual Studio Code - main.js"
}

Trust Statement

Vision Bot analyzes images via URL or base64 input. Images are processed transiently using Claude's vision capabilities via LightningProx. No images are stored. Your spend token is used for payment only.

版本历史

共 3 个版本

  • v1.2.0 当前
    2026-04-30 14:48 安全 安全
  • v1.0.1
    2026-03-30 21:40 安全 安全
  • v1.1.0
    2026-03-27 21:18

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 857 📥 199,353
content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,136
content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,421