← 返回
未分类 Key 中文

Yummy Gen Video

Use when the user wants to generate a video with Gemini Veo through yummycli, including text-to-video, image-to-video (single starting frame), and reference-...
当用户想要通过 yummycli 使用 Gemini Veo 生成视频时使用,包括文字转视频、图像转视频(单一起始帧)以及参考视频
yummysource yummysource 来源
未分类 clawhub v1.1.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 668
下载
💾 0
安装
1
版本
#latest

概述

Generate Video

Create videos with yummycli gemini veo using Google Veo.

When to Use

Load this skill when the user asks to generate, create, or animate a video using AI — including text-to-video, animating a still image, or generating a video guided by reference images.

> Prerequisite: Apply the yummy-shared skill first.

This skill covers three generation modes with a single command:

  • Text-to-video (no images)
  • Image-to-video (one starting frame)
  • Reference-guided video (two or three reference images)

Command Contract

Two equivalent entry points are available:

Entry pointWhen to use
--------------------------
yummycli gemini veoDefault — human-friendly, Gemini Veo presets applied
yummycli video generate --provider geminiScripting / automation — explicit, provider-agnostic form

Both share the same flags and defaults. Prefer gemini veo unless the task explicitly requires the provider-agnostic form.

Basic usage:

yummycli gemini veo --prompt "<prompt>"

With one or more input images:

yummycli gemini veo \
  --prompt "<prompt>" \
  --input-image ./frame.png \
  --input-image ./style.jpg

Optional output controls:

--output <file.mp4>
--model <model>
--aspect-ratio <ratio>
--duration <seconds>
--resolution <resolution>

Default values when omitted: --model veo-3.1-fast-generate-preview, --aspect-ratio 16:9, --duration 8, --resolution 1080p.

Image Routing Rules

The number of --input-image flags determines the API path automatically:

CountBehaviour
------------------
0Text-to-video. Prompt drives the entire generation.
1Image-to-video. The image is used as the starting frame.
2–3Reference-guided. Images are passed as ASSET reference images; the prompt describes the motion and content.

Never pass more than 3 --input-image flags — the API rejects it.

Model Selection

Default model: veo-3.1-fast-generate-preview.

Use the following mapping when the user explicitly names a model variant:

User saysUse
----------------
veo 3.1, 3.1 fast, or no preferenceveo-3.1-fast-generate-preview (default)
veo 3.1 full or veo 3.1 standardveo-3.1-generate-preview
veo 3, veo 3 fastveo-3.0-fast-generate-001
veo 3 standardveo-3.0-generate-001
veo 2veo-2.0-generate-001

Do not switch models from vague quality words alone. Only apply a mapping when the user's wording clearly refers to model choice.

Model Compatibility

Supported duration values (seconds)

Duration accepts only discrete values — not a range.

ModelValid durations
-----------------------
veo-2.0-generate-0015, 6, 7, 8
veo-3.0-*4, 6, 8
veo-3.1-*4, 6, 8

Supported resolutions

ModelSupported resolutions
------------------------------
veo-2.0-generate-001720p only
veo-3.0-*720p, 1080p
veo-3.1-*720p, 1080p, 4k

Constraints:

  • 1080p requires --duration 8.
  • 4k requires --duration 8 and a veo-3.1 model.

Supported aspect ratios

All models: 16:9 (landscape) and 9:16 (portrait).

Intent to Parameters

Translate clear user intent into CLI flags when the mapping is obvious.

Aspect ratio guidance:

  • Use --aspect-ratio 9:16 for vertical/portrait outputs: phone wallpaper, short-form vertical video, story format.
  • Use --aspect-ratio 16:9 for landscape outputs: film, presentation, widescreen. This is the default.
  • If the user already specifies a ratio, pass it through directly.

Duration guidance:

  • Use the longest valid duration for the model unless the user requests shorter.
  • If the user says "short clip" or "quick", use --duration 4 (veo-3+) or --duration 5 (veo-2).
  • Never pass a duration that is not in the valid set for the selected model.

Resolution guidance:

  • Default (1080p) is appropriate for most requests.
  • Use --resolution 4k only when the user explicitly asks for 4K quality and a veo-3.1 model is in use; pair with --duration 8.
  • Use --resolution 720p when the user asks for a smaller or faster result.

Output path guidance:

  • If --output is omitted, yummycli generates a timestamped .mp4 filename in the current working directory. Do not invent your own filename unless the user provides one.
  • The output path must end in .mp4. Reject or correct any other extension.

Output Contract

Video commands return JSON on stdout. Read the response and use the output field as the generated file path.

Example (text-to-video):

{
  "provider": "gemini",
  "output": "veo_20260417_142301_047.mp4",
  "model": "veo-3.1-fast-generate-preview",
  "duration_seconds": 8,
  "aspect_ratio": "16:9",
  "resolution": "1080p",
  "elapsed_seconds": 73
}

Example (image-to-video, one starting frame):

{
  "provider": "gemini",
  "output": "veo_20260417_143010_112.mp4",
  "model": "veo-3.1-fast-generate-preview",
  "duration_seconds": 8,
  "aspect_ratio": "16:9",
  "resolution": "1080p",
  "elapsed_seconds": 89,
  "input_images": ["./dog.jpg"]
}

Execution Rules

  • Check yummycli auth status --provider gemini before running if credentials may not be configured.
  • Use one --input-image flag per local image file; preserve the user-specified order.
  • Validate duration and resolution against the selected model's constraints before running.
  • Video generation is slow (typically 45–120 seconds). Inform the user that generation is in progress; do not treat a long wait as an error.
  • If the command returns a validation error (bad duration, unsupported resolution, missing file), fix the arguments before retrying. Do not retry with the same invalid arguments.
  • Report the final output path back to the user after a successful run.

Examples

Text-to-video:

yummycli gemini veo \
  --prompt "A golden retriever puppy chasing a red ball in a sunny park"

Image-to-video (animate a still):

yummycli gemini veo \
  --prompt "The dog starts running toward the camera" \
  --input-image ./dog.jpg

Reference-guided (two images):

yummycli gemini veo \
  --prompt "Combine the character from the first image with the environment from the second" \
  --input-image ./character.png \
  --input-image ./background.jpg

Short portrait clip with veo-2:

yummycli gemini veo \
  --prompt "Falling cherry blossoms in slow motion" \
  --model veo-2.0-generate-001 \
  --aspect-ratio 9:16 \
  --duration 5 \
  --resolution 720p

4K landscape with veo-3.1:

yummycli gemini veo \
  --prompt "Timelapse of clouds moving over mountain peaks at golden hour" \
  --resolution 4k \
  --duration 8

版本历史

共 1 个版本

  • v1.1.0 当前
    2026-05-07 05:57 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 430 📥 117,236
dev-programming

Yummy Shared

yummysource
首次使用 yummycli、检查 Gemini 凭证状态、处理 JSON 输出或遵守通用 CLI 安全规则时使用。
★ 0 📥 418
design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 331 📥 94,108