Llava Vision

Call a local llama.cpp server with the LLaVA model to analyze images.

调用本地llama.cpp 服务器进行 LLaVA 图像分析。

未分类 clawhub v1.0.1 1 版本 100000 Key: 无需

★ 0

Stars

📥 384

下载

💾 1

安装

版本

#latest

概述

---

name: llava-vision-local

version: 0.1.0

description: Call a local llama.cpp server with the LLaVA model to analyze images.

license: MIT

metadata:

openclaw:

emoji: "🧠"

requires:

bins: []

os:

linux
darwin
win32

LLaVA Vision Skill

> This skill forwards an image to a locally running llama.cpp server that hosts a LLaVA model and returns the model’s text description of the image. It accepts either a local file path or a remote image URL.

Usage

clawhub llava-vision --image /path/to/photo.jpg
# or
clawhub llava-vision --image https://example.com/photo.jpg

The skill uses the built‑in vision_analyze tool, which expects an image file path. If the image cannot be read or the server is unreachable, an error message will be returned.

Dependencies

Node.js (the skill itself)
A local llama.cpp server with the LLaVA model exposed at the default endpoint.

Example

$ clawhub run llava-vision --image ./cat.png
The image contains a cat sitting on a windowsill, looking out at a sunny garden.

版本历史

共 1 个版本

v1.0.1 当前

2026-05-07 08:13 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

ai-agent

Self-Improving + Proactive Agent

ivangdavila

自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。

★ 1,441 📥 328,580

ai-agent

self-improving agent

pskoett

记录自身发现以实现自我改进的技能

★ 4,164 📥 936,922

ai-agent

Agent Browser

rez0

用于 AI 代理的浏览器自动化 CLI。当用户需要与网站交互（包括浏览页面、填写表单、点击按钮、截图等）时使用。

★ 865 📥 345,374