← 返回
未分类 中文

Jarvis Vocal

Authentic J.A.R.V.I.S. voice synthesis using Piper TTS with HuggingFace-trained model. Generates movie-accurate voice locally and can push to connected Andro...
使用 Piper TTS 与HuggingFace 训练的模型实现真实的 J.A.R.V.I.S. 语音合成,能够在本地生成电影级精准语音,并推送到已连接的 Android 设备。
kishen35 kishen35 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 436
下载
💾 0
安装
1
版本
#jarvis#latest#piper#tts#voice

概述

jarvis-vocal

Uses the authentic J.A.R.V.I.S. voice model from HuggingFace (trained on actual movie lines) via Piper TTS. No audio effects needed — the voice is naturally cinematic and British.

> Credit: Voice model by jgkawell — see the discussion for details on training and samples.

Usage

Generate a WAV file:

{baseDir}/bin/jarvis-tts "Text to speak" ./output.wav

Stream directly to an Android device (if ADB connected):

{baseDir}/bin/jarvis-tts "Text to speak" - | adb push - /sdcard/Download/temp.wav

Installation

Prerequisites

pipx install piper-tts
sudo apt install ffmpeg  # or equivalent

Install Voice Model

# Create voice directory
mkdir -p ~/.local/share/piper/voices/en_GB

# Download models via HuggingFace CLI
cd ~/.local/share/piper/voices/en_GB
hf download jgkawell/jarvis en/en_GB/jarvis/high/jarvis-high.onnx --local-dir .
hf download jgkawell/jarvis en/en_GB/jarvis/high/jarvis-high.onnx.json --local-dir .
# Optional: medium quality model
hf download jgkawell/jarvis en/en_GB/jarvis/medium/jarvis-medium.onnx --local-dir .
hf download jgkawell/jarvis en/en_GB/jarvis/medium/jarvis-medium.onnx.json --local-dir .

Integration

Works with OpenClaw Android nodes via ADB over Tailscale. Use jarvis-speak wrapper for one-command push+play:

jarvis-speak "Systems at your service, Sir."

Or use streaming mode (faster, ephemeral):

jarvis-speak "Message" --stream

Configuration

SettingDefaultDescription
-------------------------------
Modeljarvis-highVoice quality: high (114MB) or medium (63MB)
Speed1.0 (native)Piper length-scale — adjust for faster/slower speech
Volume1.0Post-processing volume boost

Edit jarvis-speak script to change defaults.

Troubleshooting

"Model not found" → Download models to ~/.local/share/piper/voices/en_GB/jarvis-*

ADB connection refused → Ensure phone's ADB over WiFi is enabled and paired with laptop (port 5555)

Audio doesn't play → Check Android receives the file at /sdcard/Download/jarvis-current.wav and has a WAV-capable media player

License

MIT — The voice model is MIT licensed by jgkawell.

Credits

  • Voice model: jgkawell/jarvis on HuggingFace — trained on Marvel movie lines
  • TTS engine: Piper by Rhasspy
  • Integration: OpenClaw by Aidan Park

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-03 08:54 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 430 📥 117,266
design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 332 📥 94,152
design-media

Video Frames

steipete
使用 ffmpeg 从视频中提取帧或短片。
★ 134 📥 53,005