← 返回
内容创作 Key 中文

Seedance 2.0 — AI Video by ByteDance

Generate AI videos using ByteDance's Seedance 1.5 Pro — a native audio-visual joint generation model with cinematic camera control, multi-language lip-sync,...
使用字节跳动Seedance 1.5 Pro生成AI视频,具备原生音视频联合生成、电影级运镜控制与多语言唇形同步等功能。
xixihhhh
内容创作 clawhub v1.1.2 2 版本 99839.5 Key: 需要
★ 3
Stars
📥 1,184
下载
💾 183
安装
2
版本
#latest

概述

Seedance — AI Video Generation by ByteDance

Generate AI videos with synchronized audio using ByteDance's Seedance 1.5 Pro — featuring native audio-visual joint generation, cinematic camera control, multi-language lip-sync, and diverse sound effects.

Seedance excels at creating cinematic short clips with realistic motion, facial expressions, spatial audio, and complex camera movements.

> Data usage note: This skill sends text prompts and image URLs to the Atlas Cloud API (api.atlascloud.ai) for video generation. No data is stored locally beyond the downloaded output files. API usage incurs charges based on the model selected.


Key Capabilities

  • Text-to-Video — Generate video clips from text descriptions with synchronized audio
  • Image-to-Video — Animate still images into dynamic video with motion and audio
  • Native Audio Generation — Dialogue, sound effects, and music generated jointly with video (not post-processed)
  • Multi-Language Lip-Sync — English, Chinese (including dialects), Japanese, Korean, Portuguese, Spanish, Indonesian
  • Cinematic Camera Control — Dolly-in, snap zoom, first-person POV, tripod lock, crane shots
  • Multiple Styles — Realistic, anime, 2D animation, steampunk, ink-wash, and more
  • Resolution — Up to 720p (Pro), 480p available
  • Duration — 5-12 seconds per clip

Setup

  1. Sign up at https://www.atlascloud.ai
  2. Console → API Keys → Create new key
  3. Set env: export ATLASCLOUD_API_KEY="your-key"

Script Usage

This skill includes a Python script for video generation. Zero external dependencies required.

List available video models

python scripts/generate_video.py list-models

Generate a video

python scripts/generate_video.py generate \
  --model "bytedance/seedance-v1.5-pro/text-to-video" \
  --prompt "Your prompt" \
  --output ./output

Image-to-video

python scripts/generate_video.py generate \
  --model "bytedance/seedance-v1.5-pro/image-to-video" \
  --image "https://example.com/photo.jpg" \
  --prompt "Animate" \
  --output ./output

Run python scripts/generate_video.py generate --help for all options.


Pricing

ModelTierPriceResolutionBest For
------------------------------------------
bytedance/seedance-v1.5-pro/text-to-videoPro$0.222/videoUp to 720pHigh-quality text-to-video
bytedance/seedance-v1.5-pro/image-to-videoPro$0.222/videoUp to 720pAnimate images to video
bytedance/seedance-v1.5-pro/text-to-video-fastFast$0.018/video720pQuick drafts, prototyping
bytedance/seedance-v1.5-pro/image-to-video-fastFast$0.018/video720pQuick image animation

Pro tier delivers higher quality with more detail and coherence. Fast tier is ~12x cheaper and suitable for drafts and iteration.


Available Models

Text-to-Video

Model IDSpeedQualityAudio
---------------------------------
bytedance/seedance-v1.5-pro/text-to-videoStandard (~30-60s)HighYes
bytedance/seedance-v1.5-pro/text-to-video-fastFast (~10-20s)GoodYes

Image-to-Video

Model IDSpeedQualityAudio
---------------------------------
bytedance/seedance-v1.5-pro/image-to-videoStandard (~30-60s)HighYes
bytedance/seedance-v1.5-pro/image-to-video-fastFast (~10-20s)GoodYes

Parameters

Text-to-Video

ParameterTypeRequiredDefaultOptions
---------------------------------------------
promptstringYes-Video description
aspect_ratiostringNo16:921:9, 16:9, 4:3, 1:1, 3:4, 9:16
durationintegerNo55-12 seconds
resolutionstringNo720p720p, 480p (Pro); 720p (Fast)
generate_audiobooleanNotrueGenerate synchronized audio
camera_fixedbooleanNofalseLock camera position (tripod mode)
seedintegerNo-1 (random)For reproducible results

Image-to-Video

Same as text-to-video, plus:

ParameterTypeRequiredDescription
----------------------------------------
imagestringYesURL of the source image to animate
last_imagestringNoURL of the target end frame (for guided motion)
promptstringNoOptional text describing desired motion/action

Workflow: Submit → Poll → Download

Text-to-Video Example

# Step 1: Submit
curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "bytedance/seedance-v1.5-pro/text-to-video",
    "prompt": "A woman walks through a sunlit bamboo forest, camera slowly dollying forward. Birds chirping in the background, gentle wind rustling leaves.",
    "aspect_ratio": "16:9",
    "duration": 5,
    "resolution": "720p",
    "generate_audio": true
  }'
# Returns: { "code": 200, "data": { "id": "prediction-id" } }

# Step 2: Poll (every 5 seconds until "completed" or "succeeded")
curl -s "https://api.atlascloud.ai/api/v1/model/prediction/{prediction-id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"
# Returns: { "code": 200, "data": { "status": "completed", "outputs": ["https://...video-url..."] } }

# Step 3: Download
curl -o output.mp4 "VIDEO_URL_FROM_OUTPUTS"

Image-to-Video Example

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "bytedance/seedance-v1.5-pro/image-to-video",
    "image": "https://example.com/portrait.jpg",
    "prompt": "The person slowly turns their head and smiles, camera gently zooms in",
    "aspect_ratio": "9:16",
    "duration": 5,
    "generate_audio": true
  }'

Fast Model Example (Quick Draft)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "bytedance/seedance-v1.5-pro/text-to-video-fast",
    "prompt": "Ocean waves crashing on a rocky shore at sunset, seagulls flying overhead",
    "aspect_ratio": "16:9",
    "duration": 5,
    "generate_audio": true
  }'

Polling Logic

  • processing / starting / running → wait 5s, retry (Pro takes ~30-60s, Fast takes ~10-20s)
  • completed / succeeded → done, get URL from data.outputs[]
  • failed → error, read data.error

Atlas Cloud MCP Tools (if available)

If the Atlas Cloud MCP server is configured, use built-in tools:

atlas_quick_generate(model_keyword="seedance 1.5", type="Video", prompt="...")
atlas_generate_video(model="bytedance/seedance-v1.5-pro/text-to-video", params={...})
atlas_get_prediction(prediction_id="...")

Implementation Guide

  1. Determine task type:
    • Text-to-video: user describes a scene/action in text
    • Image-to-video: user provides an image to animate
  1. Choose model:
    • Pro for final output, client-facing content, or quality-critical use
    • Fast for quick iteration, drafts, or budget-conscious use
  1. Extract parameters:
    • Prompt: describe scene, action, camera movement, and audio cues
    • Aspect ratio: infer from context (social reel→9:16, YouTube→16:9, square→1:1, cinematic→21:9)
    • Duration: default 5s, up to 12s for longer scenes
    • Audio: enabled by default; disable with generate_audio: false if user only wants silent video
    • Camera: set camera_fixed: true for static/tripod shots
  1. Execute: POST to generateVideo API → poll result → download MP4
  1. Present result: show file path, offer to play

Prompt Tips

Seedance produces best results when prompts describe both visual and audio elements:

  • Scene + Action: "A chef flips a pancake in a busy kitchen, sizzling sounds and clattering pans"
  • Camera direction: "Camera slowly pans left to reveal...", "Close-up tracking shot of...", "First-person POV walking through..."
  • Audio cues: Include sound descriptions — "birds chirping", "rain on window", "jazz music playing softly"
  • Dialogue: For talking videos, include speech in quotes — "The narrator says: 'Welcome to our city'"
  • Style: "cinematic", "anime style", "documentary", "slow motion", "timelapse"
  • Lip-sync: For multi-language dialogue, specify the language — "A woman speaking Japanese says: 'こんにちは'"

Coming Soon: Seedance 2.0

Seedance 2.0 is ByteDance's next-generation unified multimodal video generation system, currently in preview. When available on Atlas Cloud, this skill will be upgraded with:

  • Higher resolution — Expected support for 1080p and above
  • Longer duration — Extended video length beyond 12 seconds
  • Multimodal references — Video-to-video, audio-guided generation
  • Director-level control — Fine-grained manipulation of performance, lighting, shadow, and camera
  • Enhanced motion stability — Improved realism and coherence across longer clips

The API workflow and parameter structure are expected to remain compatible. Model IDs will be updated when Seedance 2.0 becomes available — no configuration changes needed on your end.

版本历史

共 2 个版本

  • v1.1.2 当前
    2026-03-29 02:51 安全 安全
  • v1.1.0
    2026-03-26 21:40

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,203
content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,492
content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,856