← 返回
AI智能 中文

voiceclaw

Voice conversation interface for OpenClaw using wake word detection, streaming LLM responses, and text-to-speech. Use when a user wants to talk to their Open...
OpenClaw语音对话接口,具备唤醒词检测、LLM流式响应及文本转 speech功能。当用户希望与其Open...交谈时使用。
kentoku24
AI智能 clawhub v0.1.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 527
下载
💾 7
安装
1
版本
#latest

概述

voiceclaw

Voice conversation skill for OpenClaw: wake word → STT → LLM (streaming) → TTS → playback.

Requirements

  • OpenClaw running locally (gateway with chatCompletions enabled)
  • Node.js 18+
  • VOICEVOX running on localhost:50021 (download)
  • Chrome/Edge (Web Speech API for STT)
  • HTTPS for remote mic access (localhost works without HTTPS)

Quick Start

# Install
git clone https://github.com/kentoku24/voiceclaw.git
cd voiceclaw
npm install

# Start (no .env needed if OpenClaw is running locally)
npm start
# → [voiceclaw] OpenClaw config loaded from ~/.openclaw/openclaw.json
# → [voiceclaw] listening on http://127.0.0.1:8788

# Open browser
open http://127.0.0.1:8788

Press 開始, say the wake word (default: アリス), then speak your command.

Configuration

All settings are optional. Set in .env or environment variables:

VariableDefaultDescription
---------
WAKE_WORDSアリスちゃん,アリス,...Comma-separated wake words
STT_LANGja-JPSpeech recognition language
OPENCLAW_MODELopenclawLLM model name
VOICEVOX_URLhttp://127.0.0.1:50021VOICEVOX endpoint
VOICEVOX_SPEAKER1VOICEVOX speaker ID
HOST127.0.0.1Server bind address
PORT8788Server port

Gateway token is auto-detected from ~/.openclaw/openclaw.json. Override with OPENCLAW_GATEWAY_TOKEN if needed.

Architecture

Wake word (browser STT) → voiceclaw server → OpenClaw Gateway (streaming)
                                           → sentence-level TTS (VOICEVOX)
                                           → audio playback (Web Audio API)

See docs/architecture.md for the full sequence diagram.

API Endpoints

MethodPathDescription
---------
GET/healthHealth check
GET/api/configClient-safe settings (wake words, STT lang)
POST/api/chat-streamStreaming LLM → sentence-level SSE
POST/api/chatNon-streaming LLM (fallback)
POST/api/ttsText → VOICEVOX → WAV audio

版本历史

共 1 个版本

  • v0.1.0 当前
    2026-03-30 14:18 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,349 📥 317,697
ai-intelligence

Proactive Agent

halthelobster
将AI智能体从任务执行者升级为主动预判需求、持续优化的智能伙伴。集成WAL协议、工作缓冲区、自主定时任务及实战验证模式。Hal Stack核心组件 🦞
★ 833 📥 212,776
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,055 📥 795,905