← 返回
未分类 中文

Microsoft Edge TTS

Use Microsoft Edge online TTS service to convert text to speech. Supports command line and module invocation, no API key required.
使用微软 Edge 在线 TTS 服务将文本转换为语音,支持命令行和模块调用,无需 API 密钥。
openlark openlark 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 327
下载
💾 0
安装
1
版本
#latest

概述

Microsoft Edge TTS

Use Microsoft Edge's online TTS service to convert text to speech without requiring an API key. Use this skill when users need to convert text to speech, generate audio files, or read content aloud.

Trigger Conditions

Trigger this skill when the user mentions any of the following keywords:

  • TTS
  • Speech synthesis
  • Text-to-speech
  • text-to-speech
  • Read aloud
  • edge-tts

Quick Start

Command Line Usage

# Basic usage
npx node-edge-tts -t 'Hello World'

# Specify output file
npx node-edge-tts -t 'Hello World' -f './output.mp3'

# Specify voice and language
npx node-edge-tts -t 'Hello world' -v 'en-US-AriaNeural' -l 'en-US'

# Adjust speaking rate and pitch
npx node-edge-tts -t 'Hello World' -r '+10%' --pitch '-5%'

# Generate subtitle file
npx node-edge-tts -t 'Hello World' -s

Module Invocation

const { EdgeTTS } = require('node-edge-tts')
// or
import { EdgeTTS } from 'node-edge-tts'

const tts = new EdgeTTS()
await tts.ttsPromise('Hello World', './output.mp3')

Full Parameters

ParameterShortDescriptionDefault
--------------------------
--text-tText to convert (required)-
--filepath-fOutput file path./output.mp3
--voice-vVoice namezh-CN-XiaoyiNeural
--lang-lLanguage codezh-CN
--outputFormat-oOutput formataudio-24khz-48kbitrate-mono-mp3
--rate-rSpeaking ratedefault
--pitchPitchdefault
--volumeVolumedefault
--saveSubtitles-sSave subtitlesfalse
--proxy-pProxy settings-
--timeoutTimeout (ms)10000

Advanced Configuration

const tts = new EdgeTTS({
  voice: 'zh-CN-XiaoxiaoNeural',
  lang: 'zh-CN',
  outputFormat: 'audio-24khz-96kbitrate-mono-mp3',
  saveSubtitles: true,
  proxy: 'http://localhost:7890',
  pitch: '-10%',
  rate: '+10%',
  volume: '-50%',
  timeout: 10000
})

await tts.ttsPromise('Text to convert', './output.mp3')

Available Voices

  • Chinese: zh-CN-XiaoyiNeural, zh-CN-XiaoxiaoNeural, zh-CN-YunjianNeural, zh-CN-YunxiNeural, zh-CN-YunxiaNeural
  • English: en-US-AriaNeural, en-US-GuyNeural, en-US-JennyNeural
  • Japanese: ja-JP-KeitaNeural, ja-JP-NanamiNeural
  • More: Refer to Microsoft Voice Support Documentation

Subtitle Format

Enabling -s generates a .json subtitle file with the same name:

[
  { "part": "Hello", "start": 100, "end": 500 },
  { "part": "World", "start": 500, "end": 900 }
]

Time units are in milliseconds, part is the text segment.

Common Scenarios

1. Quick Speech Generation

npx node-edge-tts -t 'Welcome to speech synthesis'

2. Long Text Segmentation

For very long texts, it is recommended to process in segments and then merge.

3. Multilingual Mixed

# Chinese
npx node-edge-tts -t 'Hello World' -v 'zh-CN-XiaoxiaoNeural'

# English
npx node-edge-tts -t 'Hello World' -v 'en-US-AriaNeural'

Important Notes

  1. No API Key Required: Directly uses Microsoft Edge's free online service
  2. Network Dependent: Requires internet connection
  3. Rate Limiting: Frequent calls may be restricted; it is recommended to control call frequency appropriately
  4. Proxy Support: If encountering network issues, set a proxy via the -p parameter

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 14:36 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 331 📥 93,889
content-creation

Toutiao Graphic Publisher

openlark
通过浏览器自动化在头条发布图文内容,支持智能排版、自动生成热门标签等功能。
★ 2 📥 1,011
design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 430 📥 117,015