← 返回
内容创作 Key 中文

Kling 3.0 — AI Video Generation by Kuaishou

Generate, animate, and edit AI videos using Kuaishou's Kling 3.0 and Kling Video O3 — featuring cinematic motion quality, physics simulation, reference-based...
使用快手Kling 3.0和Kling Video O3生成、动画化并编辑AI视频,具备电影级运动质量、物理仿真、参考式等特性。
xixihhhh
内容创作 clawhub v1.0.4 2 版本 99922.1 Key: 需要
★ 0
Stars
📥 1,283
下载
💾 107
安装
2
版本
#latest

概述

Kling 3.0 & O3 — AI Video Generation by Kuaishou

Generate, animate, and edit AI videos using Kuaishou's Kling 3.0 and Kling Video O3 — featuring cinematic motion quality, realistic physics simulation, reference-based generation, and natural-language video editing.

Kling 3.0 excels at creating cinematic short clips with realistic motion, complex camera movements, and faithful prompt adherence. Kling Video O3 adds MVL (Multi-modal Visual Language) technology with reference-based generation and video editing capabilities. All models support optional synchronized sound generation.

> Data usage note: This skill sends text prompts, image URLs, and video URLs to the Atlas Cloud API (api.atlascloud.ai) for video generation and editing. No data is stored locally beyond the downloaded output files. API usage incurs charges per second based on the model selected.


Key Capabilities

  • Text-to-Video — Generate video clips from text descriptions
  • Image-to-Video — Animate still images into dynamic video with first/last frame control
  • Reference-to-Video — Generate videos using character, prop, or scene reference images (O3)
  • Video Editing — Natural-language video editing: remove/replace objects, change backgrounds, add effects (O3)
  • Sound Generation — Optional synchronized sound effects and audio
  • Pro & Standard Tiers — Pro for highest quality, Standard for cost-effective production
  • Multiple Aspect Ratios — 16:9, 9:16, 1:1
  • Flexible Duration — V3: 5 or 10 seconds; O3: 3-15 seconds
  • Negative Prompts — Specify what to exclude from generated video (V3)

Setup

  1. Sign up at https://www.atlascloud.ai
  2. Console → API Keys → Create new key
  3. Set env: export ATLASCLOUD_API_KEY="your-key"

Script Usage

This skill includes a Python script for video generation. Zero external dependencies required.

List available video models

python scripts/generate_video.py list-models

Generate a video (text-to-video)

python scripts/generate_video.py generate \
  --model "MODEL_ID" \
  --prompt "Your prompt here" \
  --output ./output \
  duration=5 resolution=720p

Generate a video (image-to-video)

python scripts/generate_video.py generate \
  --model "MODEL_ID" \
  --image "https://example.com/photo.jpg" \
  --prompt "Animate this scene" \
  --output ./output

Upload a local file

python scripts/generate_video.py upload ./local-file.jpg

Run python scripts/generate_video.py generate --help for all options. Extra model params can be passed as key=value (e.g. duration=10 shot_type=multi_camera).


Pricing

All prices are per second of video generated. Atlas Cloud offers 15% off compared to standard API pricing.

Kling V3.0

ModelTierOriginal PriceAtlas CloudBest For
-------------:--------------::-----------:----------
kwaivgi/kling-v3.0-std/text-to-videoStandard~~$0.18/s~~$0.153/sCost-effective text-to-video
kwaivgi/kling-v3.0-std/image-to-videoStandard~~$0.18/s~~$0.153/sCost-effective image animation
kwaivgi/kling-v3.0-pro/text-to-videoPro~~$0.24/s~~$0.204/sHigh-quality text-to-video
kwaivgi/kling-v3.0-pro/image-to-videoPro~~$0.24/s~~$0.204/sHigh-quality image animation

Kling Video O3 Pro

ModelOriginal PriceAtlas CloudBest For
-------:--------------::-----------:----------
kwaivgi/kling-video-o3-pro/text-to-video~~$0.24/s~~$0.204/sMVL-enhanced text-to-video
kwaivgi/kling-video-o3-pro/image-to-video~~$0.24/s~~$0.204/sMVL-enhanced image animation
kwaivgi/kling-video-o3-pro/reference-to-video~~$0.24/s~~$0.204/sReference-based video generation
kwaivgi/kling-video-o3-pro/video-edit~~$0.36/s~~$0.306/sProfessional video editing

Kling Video O3 Standard

ModelOriginal PriceAtlas CloudBest For
-------:--------------::-----------:----------
kwaivgi/kling-video-o3-std/text-to-video-$0.153/sCost-effective MVL text-to-video
kwaivgi/kling-video-o3-std/image-to-video-$0.153/sCost-effective MVL image animation
kwaivgi/kling-video-o3-std/reference-to-video-$0.085/sCost-effective reference-based generation
kwaivgi/kling-video-o3-std/video-edit-$0.238/sBudget video editing

Parameters

Kling V3.0 — Text-to-Video

ParameterTypeRequiredDefaultOptions
---------------------------------------------
promptstringYes-Video description
negative_promptstringNo-What to exclude from the video
durationintegerNo55, 10 seconds
aspect_ratiostringNo16:916:9, 9:16, 1:1
cfg_scalenumberNo0.50-1, controls prompt adherence
soundbooleanNofalseGenerate synchronized audio

Kling V3.0 — Image-to-Video

Same as V3.0 text-to-video, plus:

ParameterTypeRequiredDescription
----------------------------------------
imagestringYesURL of the source image (jpg/jpeg/png, max 10MB, min 300px, aspect ratio 1:2.5 to 2.5:1)
end_imagestringNoURL of the target end frame (for guided motion)

Kling Video O3 — Text-to-Video

ParameterTypeRequiredDefaultOptions
---------------------------------------------
promptstringYes-Video description
aspect_ratiostringNo16:916:9, 9:16, 1:1
durationintegerNo53-15 seconds
soundbooleanNofalseGenerate synchronized audio

Kling Video O3 — Image-to-Video

ParameterTypeRequiredDefaultDescription
-------------------------------------------------
promptstringYes-Video description
imagestringYes-First frame image URL
end_imagestringNo-Last frame image URL
durationintegerNo53-15 seconds
generate_audiobooleanNofalseAuto-add audio to video

Kling Video O3 — Reference-to-Video

ParameterTypeRequiredDefaultDescription
-------------------------------------------------
promptstringYes-Video description
imagesarrayNo-Reference images (up to 7 without video, up to 4 with video)
videostringNo-Reference video URL
keep_original_soundbooleanNotrueKeep original sound from reference video
soundbooleanNofalseGenerate new audio
aspect_ratiostringNo16:916:9, 9:16, 1:1
durationintegerNo53-15 seconds

Kling Video O3 — Video Editing

ParameterTypeRequiredDefaultDescription
-------------------------------------------------
promptstringYes-Editing instruction in natural language
videostringYes-Source video URL (max 10s duration)
imagesarrayNo-Reference images for element, scene, or style (max 4)
keep_original_soundbooleanNotrueKeep original audio from the video

Workflow: Submit → Poll → Download

Text-to-Video Example (V3.0 Pro)

# Step 1: Submit
curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-v3.0-pro/text-to-video",
    "prompt": "A golden retriever running through a sunlit meadow, camera tracking alongside, wildflowers swaying in the breeze",
    "aspect_ratio": "16:9",
    "duration": 5,
    "cfg_scale": 0.5,
    "sound": true
  }'
# Returns: { "code": 200, "data": { "id": "prediction-id" } }

# Step 2: Poll (every 5 seconds until "completed" or "succeeded")
curl -s "https://api.atlascloud.ai/api/v1/model/prediction/{prediction-id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"
# Returns: { "code": 200, "data": { "status": "completed", "outputs": ["https://...video-url..."] } }

# Step 3: Download
curl -o output.mp4 "VIDEO_URL_FROM_OUTPUTS"

Image-to-Video Example (V3.0 Pro)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-v3.0-pro/image-to-video",
    "image": "https://example.com/landscape.jpg",
    "prompt": "The camera slowly pans across the landscape as clouds drift by and trees sway gently",
    "aspect_ratio": "16:9",
    "duration": 5,
    "sound": false
  }'

Reference-to-Video Example (O3 Pro)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-video-o3-pro/reference-to-video",
    "prompt": "A young woman walks through a cherry blossom garden, camera follows from behind",
    "images": ["https://example.com/character-ref.jpg"],
    "aspect_ratio": "16:9",
    "duration": 5,
    "sound": false
  }'

Video Editing Example (O3 Pro)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-video-o3-pro/video-edit",
    "video": "https://example.com/original-video.mp4",
    "prompt": "Remove the person in the background and replace with a blooming cherry tree",
    "keep_original_sound": true
  }'

Standard Tier Example (Cost-Effective)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-v3.0-std/text-to-video",
    "prompt": "Ocean waves crashing on a rocky shore at sunset, seagulls flying overhead",
    "aspect_ratio": "16:9",
    "duration": 5,
    "cfg_scale": 0.5
  }'

Polling Logic

  • processing / starting / running → wait 5s, retry (typically takes ~60-120s)
  • completed / succeeded → done, get URL from data.outputs[]
  • failed → error, read data.error

Atlas Cloud MCP Tools (if available)

If the Atlas Cloud MCP server is configured, use built-in tools:

atlas_generate_video(model="kwaivgi/kling-v3.0-pro/text-to-video", params={...})
atlas_get_prediction(prediction_id="...")

Implementation Guide

  1. Determine task type:
    • Text-to-video: user describes a scene/action in text
    • Image-to-video: user provides an image to animate
    • Reference-to-video: user wants to generate video using character/prop/scene references
    • Video editing: user wants to modify an existing video
  1. Choose model family:
    • Kling V3.0 for standard text-to-video and image-to-video with negative prompts and cfg_scale control
    • Kling Video O3 for MVL-enhanced generation, reference-based video, video editing, and longer durations (3-15s)
  1. Choose tier:
    • Pro for final output, client-facing content, or quality-critical use
    • Standard for most production use, cost-effective generation
  1. Extract parameters:
    • Prompt: describe scene, action, camera movement, and visual details
    • Negative prompt (V3 only): specify undesired elements (e.g., "blurry, distorted faces, watermark")
    • Aspect ratio: infer from context (social reel→9:16, YouTube→16:9, square→1:1)
    • Duration: V3 supports 5 or 10s; O3 supports 3-15s
    • cfg_scale (V3 only): 0.5 default; increase toward 1.0 for stricter prompt adherence
    • Sound: enable if user wants audio; disabled by default
  1. Execute: POST to generateVideo API → poll result → download MP4
  1. Present result: show file path, offer to play

Prompt Tips

Kling produces best results with detailed, descriptive prompts:

  • Scene + Action: "A chef flips a pancake in a busy kitchen, steam rising from the pan"
  • Camera direction: "Camera slowly pans left to reveal...", "Close-up tracking shot of...", "Aerial view sweeping over..."
  • Style: "cinematic", "documentary style", "slow motion", "timelapse", "anime style"
  • Negative prompts (V3): Use to avoid common issues — "blurry, low quality, distorted, watermark, text overlay"
  • cfg_scale tuning (V3): Lower values (0.3-0.5) give more creative freedom; higher values (0.7-1.0) follow the prompt more strictly
  • Reference-to-video (O3): Provide clear character/prop reference images for consistent results

Image Requirements for Image-to-Video

When using image-to-video models, the source image must meet these requirements:

  • Format: JPG, JPEG, or PNG
  • Size: Maximum 10MB
  • Dimensions: Minimum 300px on shortest side
  • Aspect ratio: Between 1:2.5 and 2.5:1

版本历史

共 2 个版本

  • v1.0.4 当前
    2026-03-29 06:53 安全 安全
  • v1.0.2
    2026-03-26 22:18

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,553
content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,247
content-creation

YouTube

byungkyu
使用托管OAuth集成YouTube Data API,支持搜索视频、管理播放列表、获取频道数据及评论互动,适用于用户需要时使用此技能。
★ 142 📥 41,115