← 返回
数据分析 中文

Image Vision

Analyze and interpret images by describing content, extracting text, answering questions, comparing visuals, and extracting structured data from JPG, PNG, GI...
对图像进行分析与解读,包括描述内容、提取文字、回答问题、比较视觉信息,并从JPG、PNG、GIF等格式中抽取结构化数据。
cntuang
数据分析 clawhub v1.0.0 1 版本 97529.4 Key: 无需
★ 4
Stars
📥 7,144
下载
💾 440
安装
1
版本
#latest

概述

Vision Analyze

Analyze images using the built-in vision capabilities of multimodal AI models.

Quick Start

Analyze an Image

Describe what's in an image:

# The agent will automatically use vision when you provide an image path
image("/path/to/image.jpg", prompt="Describe what's in this image")

Extract Text (OCR)

Extract text from images:

image("/path/to/document.png", prompt="Extract all text from this image")

Analyze Multiple Images

Compare or analyze multiple images:

images(["/path/to/image1.jpg", "/path/to/image2.jpg"], 
       prompt="Compare these two images and describe the differences")

Usage Patterns

Visual Q&A

Ask specific questions about image content:

image("menu.jpg", prompt="What are the prices of the main courses?")
image("chart.png", prompt="What trend does this graph show?")
image("screenshot.png", prompt="What error message is displayed?")

Content Moderation

Check image content:

image("upload.jpg", prompt="Is this image appropriate for a professional setting?")

Data Extraction

Extract structured data from visual content:

image("receipt.jpg", prompt="Extract the date, total amount, and items purchased")
image("business_card.png", prompt="Extract name, phone, email, and company")
image("form.jpg", prompt="Extract all filled fields as key-value pairs")

Visual Comparison

Compare images:

images(["before.jpg", "after.jpg"], 
       prompt="What changes were made between these two images?")

Tips

  • Be specific: The more specific your prompt, the better the results
  • Multiple images: You can analyze up to 20 images at once
  • Supported formats: JPG, PNG, GIF, WebP
  • Size limits: Large images are automatically resized

When to Use

  • Reading text from screenshots, documents, or photos
  • Describing visual content for accessibility
  • Analyzing charts, graphs, or diagrams
  • Comparing visual changes
  • Extracting data from forms or receipts
  • Understanding UI elements or error messages

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-28 18:21 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 162 📥 59,658
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 198 📥 64,842
data-analysis

Excel / XLSX

ivangdavila
创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。
★ 366 📥 139,941