Gemini Voice Assistant

Voice-to-voice AI assistant using Gemini Live API. Speak to the AI and get spoken responses. Use when you want to have natural voice conversations with an AI...

{"answer":"基于Gemini Live API的语音AI助手。语音交流，实时回复。适合与AI进行自然语音对话..."}

alimostafaradwan

开发者工具 clawhub v1.0.0 1 版本 100000 Key: 需要

★ 1

Stars

📥 1,077

下载

💾 26

安装

版本

#latest

概述

Gemini Voice Assistant

A voice-to-voice AI assistant powered by Google's Gemini Live API. Speak to the AI and it responds with natural-sounding voice.

Usage

Text Mode

cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py "Your question or message"

Voice Mode

cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py --audio /path/to/audio.ogg "optional context"

Response Format

The handler returns a JSON response:

{
  "message": "[[audio_as_voice]]\nMEDIA:/tmp/gemini_voice_xxx.ogg",
  "text": "Text response from Gemini"
}

Configuration

Set your Gemini API key:

export GEMINI_API_KEY="your-api-key-here"

Or create a .env file in the skill directory:

GEMINI_API_KEY=your-api-key-here

Model Options

The default model is gemini-2.5-flash-native-audio-preview-12-2025 for audio support.

To use a different model, edit handler.py:

MODEL = "gemini-2.0-flash-exp"  # For text-only

Requirements

google-genai>=1.0.0
numpy>=1.24.0
soundfile>=0.12.0
librosa>=0.10.0 (for audio input)
FFmpeg (for audio conversion)

Features

🎙️ Voice input/output support
💬 Text conversations
🔧 Configurable system instructions
⚡ Fast responses with Gemini Flash

版本历史

共 1 个版本

v1.0.0 当前

2026-03-29 10:51 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

developer-tools

Github

steipete

使用 `gh` CLI 与 GitHub 交互，通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。

★ 669 📥 324,254

developer-tools

CodeConductor.ai

larsonreever

AI驱动平台，提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。

★ 68 📥 180,261

developer-tools

Agent Browser

matrixy

专为AI智能体优化的无头浏览器自动化CLI，支持无障碍树快照和基于引用的元素选择。

★ 427 📥 118,250