← 返回
未分类 Key

Voice For Openclaw Publish

MiniMax TTS skill (enhanced). Multi-agent voice support (each agent can select a unique voice written in SOUL.md), native voice message for Telegram (MP3) an...
MiniMax TTS 技能(增强版)。支持多代理语音(每个代理可选择 SOUL.md 中定义的不同音色),支持 Telegram 原生语音消息(MP3)等。
vshen009
未分类 clawhub v1.0.2 1 版本 100000 Key: 需要
★ 0
Stars
📥 423
下载
💾 0
安装
1
版本
#latest

概述

MiniMax TTS Plus

Multi-agent + multi-channel native voice message TTS skill.

Core Script

All operations go through tts-xiaoye.sh (TTS generation + channel delivery).

Quick Start

bash tts-xiaoye.sh "Text to speak"

Multi-Channel Usage

ChannelCommandFormatNotes
---------------------------------
Telegramtts-xiaoye.sh "Text"MP3Direct send, no transcoding
Feishutts-xiaoye.sh --feishu "Text"OGG/OpusAuto-transcode to native voice bubble
Generate onlytts-xiaoye.sh --generate-only "Text"MP3Generate file without sending

Send Feishu Native Voice Message (Full Flow)

OPUS=$(bash tts-xiaoye.sh --feishu "Feishu voice content" 2>/dev/null | python3 -c "import sys,json; print(json.load(sys.stdin)['audio_file'])")
openclaw message send --channel feishu --account <YOUR_ACCOUNT_ID> --target <FeishuUserID> --media "$OPUS"

Multi-Agent Voice Configuration

Each agent can choose a unique voice and write it into their SOUL.md Voice Identity section:

## Voice Identity
- TTS model: speech-2.8-hd
- TTS voice: Chinese (Mandarin)_Warm_Girl
- TTS script: scripts/tts-xiaoye.sh

Recommended voices (verified):

Voice IDStyleUse Case
---------------------------
Chinese (Mandarin)_Warm_GirlWarm GirlPersonal Assistant
female-shaonvSweet GirlDefault / General
female-tianmeiSweet FemaleGentle style
male-qn-qingseYouthful MaleMale voice scenario
Chinese (Mandarin)_Sweet_LadySweet LadyFormal场合

List Available Voices

python3 tts-xiaoye.sh --list-voices
# or directly:
python3 scripts/tts.py --list-voices

This calls the MiniMax API and prints all available voices organized by category (System Voices, Cloned Voices, Generated Voices).

Available Models

ModelCharacteristic
----------------------
speech-2.8-hdHighest quality (recommended)
speech-2.8-turboFaster, slightly lower quality

Full Parameters

tts-xiaoye.sh --text "Text" [--voice VoiceID] [--model Model] [--caption Caption]

Technical Notes

  • TTS outputs MP3 natively. Telegram sends directly via Bot API sendVoice (MP3 supported natively).
  • Feishu native voice messages require OGG/Opus format. FFmpeg handles conversion (~25ms per audio, negligible).
  • FFmpeg installation: brew install ffmpeg (Linuxbrew/macOS) or apt install ffmpeg (Linux).

Setup

  1. Copy setup.txt to .env and fill in your credentials:
  2. cp skills/voice-for-openclaw/setup.txt skills/voice-for-openclaw/.env
    # Then edit .env with your real values
    
  1. The script loads credentials from .env at runtime — no hardcoded tokens in scripts.

Required env vars:

VariableRequiredDescription
---------------------------------
MINIMAX_API_KEY✅ YesMiniMax API secret key (from platform.minimax.io)
TELEGRAM_BOT_TOKEN❌ NoTelegram bot token — only needed for sending
TELEGRAM_TARGET❌ NoTelegram chat ID — only needed together with bot token

> ⚠️ Security note: Credentials are loaded from .env only — no tokens are hardcoded in shell scripts. The .env file is gitignored and never published.

> 💡 API endpoint: The TTS API uses https://api.minimaxi.com (MiniMax's official API server), which is separate from the developer portal at platform.minimax.io.


MiniMax TTS Plus(多语言增强版)

多 Agent + 多渠道原生语音条增强版 TTS 技能。

核心脚本

所有操作通过 tts-xiaoye.sh 完成(TTS 生成 + 渠道发送)。

快速使用

bash tts-xiaoye.sh "要转语音的文字"

多渠道用法

渠道命令格式说明
------------------------
Telegramtts-xiaoye.sh "文字"MP3直接发送语音条,无需转码
飞书tts-xiaoye.sh --feishu "文字"OGG/Opus自动转码,发原生语音条
仅生成tts-xiaoye.sh --generate-only "文字"MP3只生成文件,不发送

发送飞书原生语音条(完整流程)

OPUS=$(bash tts-xiaoye.sh --feishu "飞书语音内容" 2>/dev/null | python3 -c "import sys,json; print(json.load(sys.stdin)['audio_file'])")
openclaw message send --channel feishu --account <YOUR_ACCOUNT_ID> --target <飞书用户ID> --media "$OPUS"

多 Agent 音色配置

每个 Agent 可以选择不同音色,写入各自的 SOUL.md 的 Voice Identity 节即可:

## Voice Identity
- TTS model: speech-2.8-hd
- TTS voice: Chinese (Mandarin)_Warm_Girl
- TTS script: scripts/tts-xiaoye.sh

推荐音色(已验证):

音色ID风格适用场景
-----------------------
Chinese (Mandarin)_Warm_Girl温暖少女个人助理
female-shaonv甜美少女默认/通用
female-tianmei甜美女性温柔风格
male-qn-qingse青涩青年男男声场景
Chinese (Mandarin)_Sweet_Lady甜美女声正式场合

可用模型

模型特点
------------
speech-2.8-hd最高质量(推荐)
speech-2.8-turbo快速,质量略低

完整参数

tts-xiaoye.sh --text "文字" [--voice 音色ID] [--model 模型] [--caption 文字]

技术说明

  • TTS 原生输出 MP3,Telegram 直接发送(Bot API sendVoice 支持 MP3)
  • 飞书原生语音条需要 OGG/Opus 格式,通过 FFmpeg 转换(耗时约 25ms/音频,可忽略)
  • FFmpeg 安装方式:
  • macOS/Linuxbrew: brew install ffmpeg
  • Ubuntu/Debian: apt install ffmpeg

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-05-03 07:50 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误及修正内容,以实现持续改进。适用于以下场景:(1)命令或操作意外失败;(2)用户纠正Claude(如“不,那不对……”“实际上……”);(3)用户请求的功能不存在;(4)外部API或工具出现故障;(5)Claude发现自身
★ 4,064 📥 801,632
security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,219 📥 266,990
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,363 📥 319,212