← 返回
未分类 中文

WebChat Image Support

Enables agents to detect and analyze images sent via WebChat or other channels using vision-capable models or fallback media processing.
使代理能够通过WebChat或其他渠道检测并分析图像,使用具备视觉功能的模型或后备媒体处理。
styoha
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 1
Stars
📥 416
下载
💾 0
安装
1
版本
#latest

概述

webchat-image-support

Universal image understanding enhancement for OpenClaw. This skill enables image understanding across all channels (WebChat, Discord, Slack, etc.) and works with any model that supports image input.

What It Does

When users send images through any channel, this skill ensures the agent can understand and analyze them:

  • Automatic Detection: Detects when an inbound message contains images
  • Universal Support: Works with Claude, MiniMax, OpenAI, Gemini, or any vision-enabled model
  • Fallback Processing: If model doesn't support images, uses OpenClaw's built-in media understanding pipeline
  • Multi-Image Support: Handles multiple images in a single message

Requirements

  1. Gateway with image support (OpenClaw 2026.3.29+)
  2. At least one vision-capable model configured in models.json:
    • Claude (with vision)
    • MiniMax-VL-01
    • Gemini Pro Vision
    • GPT-4 Vision

Usage

No explicit commands needed. Just send images:

User: [sends a screenshot of error]
Agent: "我看到了错误信息:Unable to load script..."

User: [sends a photo]
Agent: "这张图片显示了一个卡通猪头..."

Configuration

Model Selection

For best results, use a vision-capable model. In ~/.openclaw/agents/main/agent/models.json:

{
  "providers": {
    "minimax": {
      "models": [
        {
          "id": "MiniMax-VL-01",
          "input": ["text", "image"]
        }
      ]
    }
  }
}

Default Behavior

Model SupportBehavior
-------------------------
Model supports imagesDirect image input to model
Model no imagesUse media understanding pipeline

Troubleshooting

Q: Agent doesn't see images

A: Make sure your model supports image input (check input field in models.json)

Q: Images sent but no response

A: Check gateway logs for media processing errors

Q: Works in CLI but not WebChat

A: This skill requires OpenClaw 2026.3.29+ with the MediaPath fix

Related

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-03 08:04 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,215 📥 266,418
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 668 📥 324,023
ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 711 📥 243,715