← 返回
效率工具 Key 中文

Voice Memo

Send native iMessage voice bubbles with ElevenLabs TTS via BlueBubbles. Use when: user asks to send a voice message, wants something spoken aloud, storytelli...
通过 BlueBubbles 使用 ElevenLabs TTS 发送原生 iMessage 语音气泡。适用场景:用户要求发送语音消息、朗读内容、讲故事等。
amzzzzzzz
效率工具 clawhub v0.1.2 1 版本 100000 Key: 需要
★ 0
Stars
📥 804
下载
💾 8
安装
1
版本
#latest

概述

Voice Memo

Send native iMessage voice bubbles (not file attachments) using ElevenLabs TTS and BlueBubbles.

Quick Start

Run the script with text and recipient:

scripts/send-voice-memo.sh "Your message here" +14169060839

This will:

  1. Generate TTS audio via ElevenLabs (Rachel voice by default)
  2. Convert to Opus CAF @ 24kHz (iMessage native format)
  3. Send as native voice bubble via BlueBubbles

Requirements

  • BlueBubbles running locally with Private API enabled
  • ElevenLabs API key (for TTS)
  • macOS (for afconvert audio conversion)
  • Environment variables in ~/.openclaw/.env:

```bash

ELEVENLABS_API_KEY=your-key-here

BLUEBUBBLES_PASSWORD=your-password-here

# Optional overrides:

ELEVENLABS_VOICE_ID=21m00Tcm4TlvDq8ikWAM # Rachel (default)

ELEVENLABS_MODEL_ID=eleven_turbo_v2_5 # Turbo v2.5 (default)

```

The Working Formula

Critical parameters discovered 2026-02-19:

ParameterValueWhy
-----------------------
chatGuidany;-;+PHONENOT iMessage;-; (causes timeouts)
methodprivate-apiRequired for native bubble
isAudioMessagetrueRequired
Audio formatOpus @ 24kHz in CAFiMessage native format
Pre-convertYesDon't let BlueBubbles convert (wrong codec)

Voice Options

Default voice: Rachel (ElevenLabs)

  • Voice ID: 21m00Tcm4TlvDq8ikWAM
  • Model: eleven_turbo_v2_5 (fast, natural)
  • Cost: ~$0.04 per 30s message

Expressive tags:

  • [laughs] — natural laughter
  • [sighs] — expressive sigh
  • [excited] — energetic delivery

Example: "[excited] Oh my god, it worked!"

For full voice list and IDs, see VOICES.md.

Bidirectional Voice Memos

Sending (Amz → Amy):

Use this skill. Native voice bubbles appear with waveform UI.

Receiving (Amy → Amz):

BlueBubbles auto-converts incoming voice memos to MP3. OpenClaw transcribes via Whisper. Transcribed text flows into conversation context automatically.

Memory note: Incoming voice memo transcriptions flow into conversation context like any text message. They are NOT automatically persisted to memory or files — the agent must explicitly choose to store them, same as any conversation content. If you want to prevent transcriptions from being retained, instruct the agent not to record voice memo content in memory.

Troubleshooting

Voice bubble arrives as file attachment:

  • Check method=private-api is set
  • Verify chatGuid uses any;-; prefix (not iMessage;-;)
  • Check response has "isAudioMessage": true

API times out:

  • Use any;-;+PHONE format for chatGuid
  • Verify BlueBubbles Private API is enabled
  • Restart BlueBubbles if consistently slow

Audio is 0 seconds / unplayable:

  • Ensure pre-conversion to Opus @ 24kHz
  • Don't let BlueBubbles convert (uses wrong codec)
  • Verify with: afinfo output.caf (should show opus @ 24000 Hz)

版本历史

共 1 个版本

  • v0.1.2 当前
    2026-03-29 18:32 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

productivity

Weather

steipete
获取当前天气和预报(无需API密钥)
★ 445 📥 226,263
productivity

Nano Pdf

steipete
使用nano-pdf CLI通过自然语言指令编辑PDF
★ 275 📥 114,821
productivity

Obsidian

steipete
操作 Obsidian 仓库(纯 Markdown 笔记)并通过 obsidian-cli 自动化。
★ 432 📥 103,772