Vision Analyzer

Analyze images using Ollama Cloud's Kimi K2.5 vision capabilities. Use when user wants to describe, understand, or get information about an image. Works with...

使用 Ollama Cloud 的 Kimi K2.5 视觉功能分析图像，适用于用户想要描述、理解或获取图像信息的场景。

未分类 clawhub v1.0.1 1 版本 100000 Key: 需要

★ 0

Stars

📥 409

下载

💾 0

安装

版本

#latest

概述

Vision Analyzer

Analyze images using Kimi K2.5 multimodal vision capabilities through Ollama Cloud API.

When to Use

User wants to know what's in an image
Describing screenshots or photos
Understanding UI elements, text in images
Analyzing memes, charts, diagrams

Quick Start

python3 ~/.openclaw/workspace/skills/vision-analyzer/scripts/vision_analyze.py <image_path> [prompt]

Examples

Describe an image:

python3 ~/.openclaw/workspace/skills/vision-analyzer/scripts/vision_analyze.py photo.jpg

Ask specific question:

python3 ~/.openclaw/workspace/skills/vision-analyzer/scripts/vision_analyze.py screenshot.png "What UI elements do you see?"

Common Image Locations

Downloads: /mnt/chromeos/MyFiles/Downloads/
Screenshots: /mnt/chromeos/MyFiles/Downloads/
Home directory: ~/

Configuration

Set your Ollama API key as environment variable:

export OLLAMA_API_KEY="your-api-key-here"

Get your API key from ollama.com/settings

API Configuration

The skill uses Ollama Cloud API with Kimi K2.5 model.

API key is read from OLLAMA_API_KEY environment variable.

Supported Formats

JPG/JPEG
PNG
GIF
WebP

Output

Returns a natural language description of the image content.

版本历史

共 1 个版本

v1.0.1 当前

2026-05-07 07:29 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

design-media

Openai Whisper

steipete

使用 Whisper CLI 进行本地语音转文字（无需 API 密钥）

★ 333 📥 94,286

design-media

Nano Banana Pro

steipete

使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率，适用于图像创建、修改及编辑请求，使用 --input-image 指定输入图像。

★ 431 📥 117,427

design-media

Video Frames

steipete

使用 ffmpeg 从视频中提取帧或短片。

★ 136 📥 53,057