← 返回
数据分析 Key 中文

Zhipu AI TTS

Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Use when you need to convert text to audio files with various voice options. Supports Chin...
使用Zhipu AI的GLM-TTS模型进行文本转语音,支持多种音色,可将文本转换为音频文件。
franklu0819-lang
数据分析 clawhub v1.0.0 1 版本 99761.7 Key: 需要
★ 1
Stars
📥 1,236
下载
💾 29
安装
1
版本
#latest

概述

Zhipu AI Text-to-Speech

Convert Chinese text to natural-sounding speech using Zhipu AI's GLM-TTS model.

Setup

1. Get your API Key:

Get a key from Zhipu AI Console

2. Set it in your environment:

export ZHIPU_API_KEY="your-key-here"

Available Voices

System Voices (Pre-built)

  • tongtong (彤彤) - Default voice, balanced tone
  • chuichui (锤锤) - Male voice, deeper tone
  • xiaochen (小陈) - Young professional voice
  • jam - 动动动物圈 Jam voice
  • kazi - 动动动物圈 Kazi voice
  • douji - 动动动物圈 Douji voice
  • luodo - 动动动物圈 Luodo voice

Usage

Basic Text-to-Speech

Convert text to speech with default settings (tongtong voice, normal speed, WAV format):

bash scripts/text_to_speech.sh "你好,今天天气怎么样"

Advanced Options

Specify voice, speed, format, and output filename:

bash scripts/text_to_speech.sh "欢迎使用智能语音服务" xiaochen 1.2 wav greeting.wav

Parameters:

  • text (required): Chinese text to convert (max 1024 characters)
  • voice (optional): tongtong (default), chuichui, xiaochen, jam, kazi, douji, luodo
  • speed (optional): Speech speed from 0.5 to 2.0 (default: 1.0)
  • output_format (optional): wav (default), pcm
  • output_file (optional): Output filename (default: output.{format})

Voice Selection Guide

Choose tongtong (default) for:

  • General purpose narration
  • Professional presentations
  • Balanced tone requirements

Choose chuichui for:

  • Male voice needed
  • Deeper, authoritative tone
  • Documentary or formal content

Choose xiaochen for:

  • Young, energetic tone
  • Modern, casual content
  • Friendly assistant vibe

Choose jam/kazi/douji/luodo for:

  • Entertainment content
  • Character voices
  • Creative projects

Speed Control

Recommended speeds:

  • 0.8-1.0: Clear, professional narration
  • 1.0-1.2: Natural conversational pace (default: 1.0)
  • 1.2-1.5: Energetic, upbeat delivery
  • 1.5-2.0: Fast-paced summaries (may reduce clarity)

Output Formats

WAV (recommended):

  • Standard audio format
  • Widely compatible
  • Better quality preservation

PCM:

  • Raw audio format
  • Smaller file size
  • Requires additional processing for playback

Examples

Create a professional greeting:

bash scripts/text_to_speech.sh "您好,感谢致电智能客服,请按1选择中文服务" tongtong 1.0 wav greeting.wav

Generate an energetic announcement:

bash scripts/text_to_speech.sh "热烈欢迎各位嘉宾参加今天的活动!" xiaochen 1.3 wav announcement.wav

Create a calm narration:

bash scripts/text_to_speech.sh "在这个宁静的夜晚,让我们一起欣赏美丽的星空" chuichui 0.9 wav narration.wav

Character Limits

  • Maximum input: 1024 characters per request
  • For longer texts, split into multiple segments
  • Combine audio files post-generation

Audio Quality Tips

Best practices:

  • Use punctuation for natural pauses (commas, periods)
  • Break long sentences into shorter segments
  • Use appropriate line breaks for paragraph pauses
  • Test speed settings for your specific content

Sample rate: Generated audio uses 24000 Hz sampling rate for optimal quality.

Troubleshooting

Text Length Issues:

  • Split texts longer than 1024 characters
  • Process segments separately
  • Combine using audio editing tools

Audio Quality Issues:

  • Check text encoding (use UTF-8)
  • Verify punctuation placement
  • Adjust speed settings
  • Try different voices

File Playback Issues:

  • Ensure format compatibility with your player
  • WAV format works on most systems
  • PCM may require conversion

API Notes

  • Responses are returned as audio files
  • Watermarking enabled by default (can be disabled in account settings)
  • No strict rate limiting documented
  • Audio generation typically completes in 1-3 seconds

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 07:27 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 164 📥 59,799
data-analysis

Excel / XLSX

ivangdavila
创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。
★ 367 📥 140,147
productivity

Douyin Hot Trend

franklu0819-lang
获取抖音热榜/热搜榜数据,包含热门视频、挑战赛、音乐等多领域热门内容,并输出标题、热度值、跳转链接及封面图(如有)。
★ 41 📥 10,777