runcomfy.com · docs · Seedance 2.0 Pro model page
Seedance 2.0 Pro is ByteDance's multi-modal cinematic short-form video model. This skill generates video with Seedance 2.0 Pro hosted on the RunComfy Model API — no Seedance API key, no GPU rental, just runcomfy run bytedance/seedance-v2/pro from your terminal.
Seedance 2.0 Pro is the second-generation Seedance model from ByteDance, designed for cinematic short-form video with three properties that make Seedance distinct:
Seedance 2.0 Pro generates 4–15 second clips at 480p or 720p, in 7 aspect ratios. Seedance prompts accept Chinese (≤500 chars) or English (≤1000 words).
Pick Seedance 2.0 Pro when any of these is true:
seed for deterministic Seedance output.If the user said "Seedance" / "Seedance 2" / "Seedance Pro" / "Seedance v2" / "ByteDance Seedance" explicitly, route here regardless.
npm i -g @runcomfy/cliruncomfy login opens a browser device-code flow.RUNCOMFY_TOKEN= instead of runcomfy login.bytedance/seedance-v2/proThis is the Seedance 2.0 Pro endpoint. The Seedance Lite tier and earlier Seedance versions run on different endpoints not covered here.
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| --- | --- | --- | --- | --- |
prompt | string | yes | — | Seedance accepts CN ≤ 500 chars OR EN ≤ 1000 words. |
image_url | array | no | [] | 0–9 image references for Seedance (JPEG/PNG/WebP/BMP/TIFF/GIF). |
video_url | array | no | [] | 0–3 reference clips for Seedance (MP4/MOV), 2–15s each. |
audio_url | array | no | [] | 0–3 reference audio for Seedance (WAV/MP3), 2–15s, < 15MB each. |
aspect_ratio | enum | no | adaptive | adaptive, 16:9, 9:16, 4:3, 3:4, 1:1, 21:9. |
duration | int | no | 5 | 4–15 (whole seconds). Seedance per-call cap is 15s. |
resolution | enum | no | 720p | 480p or 720p. Seedance Pro tier max is 720p. |
generate_audio | bool | no | true | In-pass synchronized speech / SFX / music from Seedance. |
seed | int | no | — | Reproducibility for Seedance output. |
Default Seedance run (text only, 5s, 720p, with audio):
runcomfy run bytedance/seedance-v2/pro \
--input '{"prompt": "<Seedance prompt>"}' \
--output-dir <absolute/path>
Seedance lip-synced ad with character image reference:
runcomfy run bytedance/seedance-v2/pro \
--input '{
"prompt": "Medium close-up. The woman explains today'\''s special in a warm friendly tone, slow push-in, soft window light, gentle cafe ambience.",
"image_url": ["https://.../barista-headshot.jpg"],
"duration": 8,
"aspect_ratio": "9:16"
}' \
--output-dir <absolute/path>
Multi-modal Seedance call (image + video + audio refs):
runcomfy run bytedance/seedance-v2/pro \
--input '{
"prompt": "Subject from image 1 walks through the café from video 1, voice tone matches audio 1.",
"image_url": ["https://.../subject.jpg"],
"video_url": ["https://.../cafe-locked-shot.mp4"],
"audio_url": ["https://.../voice-ref.mp3"]
}' \
--output-dir <absolute/path>
The CLI submits the Seedance request, polls every 2s, fetches the Seedance result, and downloads any .runcomfy.net / .runcomfy.com URL into --output-dir.
Seedance 2.0 Pro responds to specific prompting patterns better than naive prose. Apply these for sharper Seedance output.
Image vs text division — the single most important Seedance rule. Stable identity (face, costume, brand mark, logo) → put in image_url so Seedance preserves it. Evolving narrative (action, mood, lighting, camera) → put in prompt so Seedance generates it. Trying to verbally describe a face in detail wastes Seedance tokens and produces drift.
Camera + motion in plain language. Seedance 2.0 Pro understands "medium close-up", "slow push-in", "handheld follow", "locked-off wide" as real directives. Combine: "Medium close-up. Slow push-in over 3 seconds. Handheld, slight breathing motion." Seedance executes the camera grammar.
Audio direction with generate_audio: true — tell Seedance the tone: "warm friendly conversational", "calm instructional", "crisp newsroom delivery". For ambient: "gentle cafe chatter, distant traffic, no foreground music". Seedance will synthesize audio matching the directive.
Seedance reference media specs. Reference videos must be 2–15s; reference audio must be ≤15MB and 2–15s. Out-of-range files reject. Match aspect ratio of refs to the Seedance output to avoid crops.
Seedance anti-patterns:
image_url instead.| Use case | Why Seedance 2.0 Pro |
|---|---|
| --- | --- |
| Spokesperson / dialogue ads | Seedance native in-pass lip-sync, no separate TTS step |
| Brand-consistent multi-language narratives | Seedance image refs hold identity; text drives translation |
| Cinematic short-form film previs | Seedance camera-shot grammar + multi-modal refs |
| Ad creatives with reference music / VO tone | Seedance audio refs guide voice / mood |
| Reproducible Seedance variant testing | Seedance seed control + fixed schema |
Default Seedance playground example:
Golden hour on a quiet cafe terrace: a barista wipes the counter, then
looks up and explains today's special in a friendly tone, natural
lip-sync. Medium close-up, slow push-in; warm side light, soft bokeh
through glass, gentle cafe ambience and subtle film grain.
Multi-modal Seedance lip-sync (text + image):
Same person as image 1 in a softly-lit recording booth, leaning into
the mic, says: "We just shipped the biggest update of the year."
Calm conversational tone. Medium close-up, locked tripod, shallow DOF,
warm key light from camera-left.
What's the max Seedance clip duration? A single Seedance 2.0 Pro call generates 4–15 seconds. For longer narratives, segment into multiple Seedance calls and stitch the outputs.
What aspect ratios does Seedance 2.0 Pro support? Seven: adaptive, 16:9, 9:16, 4:3, 3:4, 1:1, 21:9. Seedance defaults to adaptive (matches input refs).
Does Seedance 2.0 Pro do lip-sync? Yes. With generate_audio: true (default), Seedance produces lip-synced speech in-pass. The lip movement on Seedance output is timed to the spoken words.
Can Seedance take an existing audio file as input? Yes — pass it as audio_url. Seedance treats it as a reference (voice tone, mood) rather than a strict lip-sync driver. For audio-driven lip-sync to a literal voiceover, route to a different model.
What languages does Seedance 2.0 Pro accept? Chinese (≤500 chars) or English (≤1000 words) prompts. Seedance output language follows the prompt.
What's the Seedance resolution ceiling? 720p on the Seedance Pro tier here. 4K Seedance variants run on different endpoints not covered by this skill.
How do I get reproducible Seedance output? Pass seed as a fixed int. Same Seedance prompt + same seed = same Seedance generation.
@-syntax for character binding in Seedance — relies on image refs + prompt alignment.| code | meaning |
|---|---|
| --- | --- |
| 0 | Seedance generation succeeded |
| 64 | bad CLI args |
| 65 | bad input JSON for Seedance / schema mismatch |
| 69 | upstream 5xx |
| 75 | retryable: timeout / 429 |
| 77 | not signed in or token rejected |
Full reference: docs.runcomfy.com/cli/troubleshooting.
The skill invokes runcomfy run bytedance/seedance-v2/pro with a JSON body matching the Seedance schema. The CLI POSTs to https://model-api.runcomfy.net/v1/models/bytedance/seedance-v2/pro, polls the Seedance request, fetches the Seedance result, and downloads any .runcomfy.net / .runcomfy.com URL into --output-dir. Ctrl-C cancels the remote Seedance request before exit.
runcomfy login writes the API token to ~/.config/runcomfy/token.json with mode 0600.--input. No shell injection.model-api.runcomfy.net and .runcomfy.net / .runcomfy.com.共 1 个版本