← 返回
未分类 Key 中文

GLM-V-Prompt-Gen

Analyze images/videos and generate professional prompts for text-to-image and text-to-video AI tools (Midjourney, Stable Diffusion, DALL-E, Sora, Runway, Kli...
分析图像/视频,为文本到图像和文本到视频的AI工具(Midjourney、Stable Diffusion、DALL‑E、Sora、Runway、Kling等)生成专业提示词。
jaredforreal jaredforreal 来源
未分类 clawhub v1.0.3 1 版本 99880.1 Key: 需要
★ 1
Stars
📥 813
下载
💾 3
安装
1
版本
#latest

概述

GLM-V Prompt Generation Skill

Analyze reference images or videos and generate professional prompts for AI image/video generation tools.

When to Use

  • Generate prompts for text-to-image tools (Midjourney, Stable Diffusion, DALL-E, etc.)
  • Generate prompts for text-to-video tools (Sora, Runway, Kling, Pika, etc.)
  • User mentions "生成prompt", "文生图prompt", "文生视频prompt", "prompt工程", "参考图生成prompt", "generate prompt"
  • User provides an image/video and wants to recreate or remix it
  • Extract prompt ideas from reference visual content

Supported Input Types

TypeFormatsMax SizeMax CountBase64
----------------------------------------------------------
Imagejpg, png, jpeg5MB / 6000×6000px50
Videomp4, mkv, mov200MB❌ (URL only)

> ⚠️ Images and videos cannot be used in the same request.

> ⚠️ Videos only support URLs — local paths and base64 are NOT supported.

📋 Output Display Rules (MANDATORY)

After running the script, you must display the full prompt output exactly as returned. Do not summarize, truncate, or only say "prompt generated". Users need the complete prompt (especially the English prompt) for direct copy/paste.

  • Show the full output: content analysis + prompt + prompt breakdown
  • In auto mode, show both text-to-image and text-to-video prompts
  • English prompts are core output and must be shown completely
  • If output was saved (-o), provide the file path and show file content

Output Modes

ModeDescription
---------------------------------------------------------
imageGenerate prompts for text-to-image tools (default)
videoGenerate prompts for text-to-video tools
autoGenerate prompts for both image and video

Resource Links

ResourceLink
------------------------------------------------------------------------------------------------------------------------------------------------
Get API Keyhttps://bigmodel.cn/usercenter/proj-mgmt/apikeys
API DocsChat Completions / 对话补全

Prerequisites

API Key Setup / API Key 配置(Required / 必需)

This script reads the key from the ZHIPU_API_KEY environment variable and shares it with other Zhipu skills.

脚本通过 ZHIPU_API_KEY 环境变量获取密钥,与其他智谱技能共用同一个 key。

Get Key / 获取 Key: Visit Zhipu Open Platform API Keys / 智谱开放平台 API Keys to create or copy your key.

Setup options / 配置方式(任选一种):

  1. OpenClaw config (recommended) / OpenClaw 配置(推荐): Set in openclaw.json under skills.entries.glmv-prompt-gen.env:

```json

"glmv-prompt-gen": { "enabled": true, "env": { "ZHIPU_API_KEY": "你的密钥" } }

```

  1. Shell environment variable / Shell 环境变量: Add to ~/.zshrc:

```bash

export ZHIPU_API_KEY="你的密钥"

```

> 💡 If you already configured another Zhipu skill (for example zhipu-tools or glmv-caption), they share the same ZHIPU_API_KEY, so no extra setup is needed.

> 💡 如果你已为其他智谱 skill(如 zhipu-toolsglmv-caption)配置过 key,它们共享同一个 ZHIPU_API_KEY,无需重复配置。

How to Use

Image → Text-to-Image Prompt

python scripts/prompt_gen.py --images "https://example.com/photo.jpg"
python scripts/prompt_gen.py --images /path/to/photo.png

Image → Text-to-Video Prompt

python scripts/prompt_gen.py --images "https://example.com/scene.jpg" --mode video

Image → Both (Image + Video Prompts)

python scripts/prompt_gen.py --images "https://example.com/photo.jpg" --mode auto

Video → Text-to-Video Prompt

python scripts/prompt_gen.py --videos "https://example.com/clip.mp4" --mode video

Save Result to File

python scripts/prompt_gen.py --images photo.jpg --mode image -o prompt.md

Custom Model

python scripts/prompt_gen.py --images photo.jpg --model glm-4.6v-flash

Output Example (image mode)

### Content Analysis
A cyberpunk cityscape at night, with dense skyscrapers, glowing neon signs, and rain-wet streets reflecting colorful light.

### Prompt
Cyberpunk cityscape at night, towering skyscrapers with glowing neon signs,
rain-wet streets reflecting colorful lights, flying cars in the distance,
volumetric fog, dramatic lighting, ultra detailed, 8K, cinematic composition

### Prompt Breakdown
- **Subject**: Futuristic skyline with skyscrapers and neon lights
- **Style**: Cyberpunk, sci-fi
- **Color**: Cool/warm contrast with blue-purple dominance and neon accents
- **Lighting**: Neon glow, wet-surface reflections, volumetric fog
- **Composition**: Wide-angle perspective with layered depth
- **Mood**: Mysterious, futuristic, high-tech

CLI Reference

python scripts/prompt_gen.py (--images IMG [IMG...] | --videos VID [VID...]) [OPTIONS]
ParameterRequiredDescription
-------------------------------------------------------------------------------
--images, -iOne ofImage paths or URLs (jpg/png/jpeg, base64 OK)
--videos, -vOne ofVideo URLs (mp4/mkv/mov, URL only)
--mode, -mNoOutput mode: image (default), video, or auto
--modelNoModel name (default: glm-4.6v)
--temperature, -tNoSampling temperature 0-1 (default: 0.6)
--max-tokensNoMax output tokens (default: 2048)
--thinkingNoEnable thinking/reasoning mode
--streamNoEnable streaming output
--output, -oNoSave result to file
--prettyNoPretty-print JSON error output

Error Handling

API key not configured: → Guide user to configure ZHIPU_API_KEY

Authentication failed (401/403): → API key invalid/expired → check at Zhipu API Keys / 智谱官网

Rate limit (429): → Quota exhausted → wait and retry

Content filtered:warning field present → content blocked by safety review

Timeout: → Video processing may take time → increase timeout or use smaller files

版本历史

共 1 个版本

  • v1.0.3 当前
    2026-05-03 03:47 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 428 📥 116,598
ai-agent

GLM-Master-Skill

jaredforreal
仅文档型主技能,用于 GLM 生态系统的发现与安装。此技能不执行脚本或子进程命令,提供精选...
★ 5 📥 1,372
design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 330 📥 93,465