← 返回
未分类 Key 中文

Xiaomi MiMo-V2-TTS

Converts text to speech using Xiaomi MiMo-V2-TTS with support for emotional styles, Chinese dialects, role voices, and singing synthesis.
使用小米MiMo-V2-TTS将文本转为语音,支持情感风格、汉语方言、角色语音和歌唱合成。
ddpie ddpie 来源
未分类 clawhub v1.0.0 1 版本 99738.2 Key: 需要
★ 0
Stars
📥 381
下载
💾 1
安装
1
版本
#latest

概述

Xiaomi MiMo-V2-TTS Skill

Text-to-speech using Xiaomi's MiMo-V2-TTS model. Supports emotional style control, Chinese dialects (Northeastern/Sichuan/Cantonese/Taiwanese), role-playing voices, and singing synthesis.

When to Use

  • User asks to convert text to speech / audio
  • User mentions "read aloud", "TTS", "voice synthesis", "narrate"
  • User wants specific voice styles, emotions, or dialects

API Details

  • Platform: https://platform.xiaomimimo.com
  • Base URL: https://api.xiaomimimo.com/v1
  • Endpoint: /v1/chat/completions (NOT /audio/speech)
  • Model: mimo-v2-tts
  • Auth: Bearer Token via MIMO_API_KEY env var

Important: API Format

MiMo TTS uses the Chat Completions endpoint with special requirements:

  • ❌ No system role allowed (returns error)
  • ✅ Must include assistant role message (the text to synthesize)
  • user message = style/voice instructions
  • assistant message = text to be spoken
  • Response: choices[0].message.audio.data contains base64-encoded audio

Usage

python3 <skill_dir>/scripts/mimo_tts.py \
  --text "Hello, world!" \
  --output /tmp/openclaw/tts_output.mp3 \
  [--style "cheerful tone"] \
  [--speed 1.0] \
  [--format mp3] \
  [--api-key YOUR_KEY]

Set MIMO_API_KEY environment variable or pass --api-key.

Parameters

ParameterRequiredDescription
----------------------------------
--textText to synthesize (recommended < 5000 chars)
--outputOutput audio file path
--styleNatural language style description
--speedSpeech rate 0.5–2.0 (default 1.0)
--formatmp3/wav/pcm/opus/flac (default mp3)
--api-keyAPI Key (overrides env var)

Style Control Examples

  • Dialects: --style "speak in Cantonese" / "Sichuan dialect" / "Taiwanese accent"
  • Emotions: --style "happy and excited" / "sad and gentle" / "start happy then turn melancholic"
  • Characters: --style "news anchor" / "gentle older sister"
  • Singing: --style "sing it"
  • Combined: --style "Northeastern dialect, enthusiastic and bold"

Notes

  • Pricing: Free during launch period (March 2026), may charge later
  • Supports Chinese and English text
  • Best results with Chinese text and style descriptions in Chinese
  • Get API key at https://platform.xiaomimimo.com

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-03 08:45 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Batch File Processor

ddpie
利用子代理(总结、分析、提取、转换)对大型文件集进行并行批处理。在对多个文件执行相同操作时使用。
★ 0 📥 618
design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 424 📥 116,013
design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 216 📥 46,295