← 返回
AI智能

Voice Agent

Local Voice Input/Output for Agents using the AI Voice Agent API.
利用 AI Voice Agent API 实现代理的本地语音输入/输出。
ricardotrevisan
AI智能 clawhub v1.1.0 1 版本 99337.9 Key: 无需
★ 1
Stars
📥 4,331
下载
💾 29
安装
1
版本
#latest

概述

Voice Agent

This skill allows you to speak and listen to the user using a local Voice Agent API.

It is client-only and does not start containers or services.

It uses local Whisper for Speech-to-Text transcription and AWS Polly for Text-to-Speech generation.

Prerequisite

Requires a running backend API at http://localhost:8000.

Backend setup instructions are in this repository:

  • README.md
  • walkthrough.md
  • DOCKER_README.md

Behavior Guidelines

  • Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File.
  • Silent Delivery: When sending an audio response, DO NOT send a text explanation like "I sent an audio". Just send the audio file.
  • Workflow:
  1. User sends audio.
  2. Use transcribe to read it.
  3. You think of a response.
  4. Use synthesize to generate the audio file.
  5. You send the file.
  6. STOP. Do not add text commentary.
    • Failure Handling: If health fails or connection errors occur, do not attempt service management from this skill. Ask the user to start or fix the backend using the repository docs.

Tools

Transcribe File

To transcribe an audio file with local Whisper STT, run the client script with the transcribe command.

python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"

Synthesize to File

To generate audio from text with AWS Polly TTS and save it to a file, run the client script with the synthesize command.

python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"

Health Check

To check if the voice agent API is running and healthy:

python3 {baseDir}/scripts/client.py health

版本历史

共 1 个版本

  • v1.1.0 当前
    2026-03-28 10:44 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,349 📥 317,700
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,055 📥 795,992
developer-tools

Garmin Tracker

ricardotrevisan
从2026-02-01起,根据固定架构,利用Garmin网页数据(活动+训练计划)重建并维护garmin_tracking.。
★ 0 📥 1,182