← 返回
未分类 Key 中文

corespeed-studio

Generate video, images, audio, and music using 40+ AI models via fal.ai. Use for video generation (Kling v3, Sora 2, Veo 3.1, LTX 2.3, Pixverse v5), image ge...
通过 fal.ai 使用 40 多种 AI 模型生成视频、图像、音频和音乐。用于视频生成 (Kling v3、Sora 2、Veo 3.1、LTX 2.3、Pixverse v5) 和图像生成
zypher-agent zypher-agent 来源
未分类 clawhub v1.0.0 1 版本 99762.5 Key: 需要
★ 1
Stars
📥 400
下载
💾 2
安装
1
版本
#latest

概述

Corespeed Art — Multi-Model AI Media via fal.ai

Auth: Set FAL_KEY with your fal.ai API key (get one at https://fal.ai/dashboard/keys).

Workflow

  1. Pick a model from the tables below
  2. Read its reference file to get the exact endpoint and parameters
  3. Run the command with the endpoint and JSON parameters

Usage

uv run {baseDir}/scripts/fal.py ENDPOINT --json '{"param":"value"}' -f output.ext [-i input.ext]
  • ENDPOINT — the fal.ai model path from the reference file (e.g. fal-ai/nano-banana-2)
  • --json — model parameters as JSON object
  • -f — output filename
  • -i — input file(s) to upload (repeat for multiple), auto-injected as image_url/image_urls/start_image_url/video_url
  • --audio — audio input file (for lipsync)

Image Generation

ModelBest ForReference
----------------------------
Nano Banana 2Pro quality, web search, thinkingRead nanobanana.md
FLUX 2 ProPhotorealistic, zero-configRead flux.md
FLUX Schnell⚡ Fastest iterationRead flux.md
FLUX Pro v1.1Accelerated, commercial useRead flux.md
FLUX.1 Dev12B params, fine-tuning friendlyRead flux.md
GPT Image 1.5Transparent bg, instruction followingRead gpt.md
Qwen Image 2 ProChinese+English, typography, native 2KRead qwen.md
Recraft V4 ProDesign/marketing, color controlRead recraft.md
Seedream 5 LiteMulti-image editing, reasoningRead seedream.md

Video Generation

ModelBest ForReference
----------------------------
Kling v3 Pro I2VBest I2V, multi-shot, audio, 3–15sRead kling.md
Sora 2 T2VLong video up to 20s, charactersRead sora.md
Sora 2 I2VImage→video with SoraRead sora.md
Veo 3.1 T2VCinematic + native audio/dialogueRead veo.md
Veo 3.1 I2VImage→video with audioRead veo.md
LTX 2.3 T2V Fast⚡ Fast, up to 2160p/20s, open sourceRead ltx.md
LTX 2.3 I2VImage→video, start+end frameRead ltx.md
Pixverse v5 I2VAnime, 3D, clay, cyberpunk stylesRead pixverse.md

Audio / TTS

ModelBest ForReference
----------------------------
MiniMax Speech-02 HD30+ languages, loudness normalizationRead minimax-speech.md

Music & Sound Effects

ModelBest ForReference
----------------------------
Beatoven MusicAI music, up to 90sRead beatoven-music.md

Utilities

ToolBest ForReference
---------------------------
Topaz UpscaleAI image/video upscale 2x–4xRead topaz.md
BRIA RMBGProfessional background removalRead bria-rmbg.md
Sync LipsyncAudio-driven lip sync on videoRead sync-lipsync.md

Notes

  • No manual Python setup required. The script uses PEP 723 inline metadata. uv run automatically creates an isolated virtual environment and installs the fal-client dependency on first run.
  • fal.ai uses a queue system — the script polls until generation completes.
  • Video generation can take 30s–3min.
  • Use timestamps in filenames: yyyy-mm-dd-hh-mm-ss-name.ext.
  • Script prints MEDIA: line for OpenClaw to auto-attach.
  • Do not read generated media back; report the saved path only.

Support

Built by Corespeed. If you need help or run into issues:

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-31 01:38 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

office-efficiency

corespeed-slide

zypher-agent
使用 JSX/TSX 配合 Deno 生成专业的 PowerPoint (.pptx) 演示文稿,支持幻灯片、文本、形状、表格、图表(柱状图、折线图、饼图、环形图)、图片等元素。
★ 0 📥 607
design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 216 📥 46,599
design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 329 📥 92,968