← 返回
未分类 Key 中文

Minimax Tools

Direct MiniMax API integration for speech synthesis (TTS), voice cloning, image generation, video generation, and music generation using local Python scripts...
直接集成 MiniMax API,通过本地 Python 脚本实现语音合成、声音克隆、图像生成、视频生成及音乐生成。
cytwyatt
未分类 clawhub v0.1.0 1 版本 100000 Key: 需要
★ 1
Stars
📥 485
下载
💾 26
安装
1
版本
#latest

概述

MiniMax Tools

Use this skill to call MiniMax multimodal APIs directly through local Python wrappers instead of relying on an external MCP server.

Overview

This skill currently supports:

  • Speech synthesis (TTS)
  • Voice cloning
  • Image generation
  • Video generation
  • Music generation

All wrappers are exposed through a single entrypoint script:

python3 scripts/minimax.py <subcommand> ...

Read references/api-notes.md only when you need endpoint details or parameter reminders.

Prerequisites

Expect these environment variables to be available before running the scripts:

  • MINIMAX_API_KEY

Optional:

  • MINIMAX_BASE_URL if you need to override the default API host

Python dependency:

  • requests

Routing guide

  • Use tts for speech synthesis
  • Use voice for uploading clone inputs, creating cloned voices, and optionally downloading preview audio
  • Use image for text-to-image or reference-image generation
  • Use video for text-to-video, image-to-video, or first/last-frame video workflows
  • Use music for song or instrumental generation

TTS defaults

  • Default model: speech-2.8-turbo
  • Default format: mp3
  • Default sample rate: 32000
  • Default bitrate: 128000
  • Default Chinese voice: Chinese (Mandarin)_Lyrical_Voice
  • Default English voice: English_Graceful_Lady
  • If --voice is omitted, the script uses --voice-lang zh|en and defaults to zh

Voice cloning notes

  • Clone source audio constraints:
  • mp3, m4a, or wav
  • 10 seconds to 5 minutes
  • <= 20 MB
  • Optional prompt audio constraints:
  • mp3, m4a, or wav
  • under 8 seconds
  • <= 20 MB
  • If cloning succeeds, the returned voice_id can be used immediately in TTS
  • MiniMax documentation notes cloned voices are temporary unless used in real TTS within 7 days

Video support

Supported modes:

  • text-to-video: video create
  • image-to-video: video i2v
  • first/last-frame video: video fl2v

Video creation is asynchronous. Use video query, video wait, and video download for task follow-up.

File handling rules

  • Prefer saving outputs locally and returning file paths
  • Local image inputs for image/video wrappers can be converted to Data URLs automatically
  • Prefer URL-based output when MiniMax returns temporary files, then download immediately
  • Avoid tight polling loops for async video jobs

Resources

  • scripts/minimax.py - unified CLI entrypoint
  • scripts/minimax_tts.py - TTS wrapper
  • scripts/minimax_voice.py - voice cloning wrapper
  • scripts/minimax_image.py - image generation wrapper
  • scripts/minimax_video.py - video generation wrapper
  • scripts/minimax_music.py - music generation wrapper
  • references/api-notes.md - focused API notes and constraints

版本历史

共 1 个版本

  • v0.1.0 当前
    2026-03-30 20:23 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 668 📥 324,044
security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,215 📥 266,430
ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 711 📥 243,730