← 返回
AI智能 Key 中文

Jetson CUDA Voice

High-performance offline voice pipeline for NVIDIA Jetson with wake word detection, STT, and TTS.
高性能离线语音管线,面向 NVIDIA Jetson,包含唤醒词检测、语音识别(STT)和语音合成(TTS)。
nikil511
AI智能 clawhub v1.2.0 2 版本 99821.6 Key: 需要
★ 0
Stars
📥 1,119
下载
💾 23
安装
2
版本
#latest

概述

Jetson CUDA Voice Skill

A high-performance, bilingual (English/Greek) local voice assistant pipeline designed specifically for resource-constrained platforms like the NVIDIA Jetson Xavier NX.

Architecture

┌─────────────────────────────────────────────────┐
│ JETSON CUDA VOICE PIPELINE (COMPLETE STACK)     │
├─────────────────────────────────────────────────┤
│                                                  │
│ 1. ReSpeaker Microphone (hw:Array,0)            │
│    └─ Always listening                          │
│                                                  │
│ 2. Silero VAD (330ms)                           │
│    └─ "Is there speech?"                        │
│                                                  │
│ 3. Hey Jarvis Detection (137ms)                 │
│    └─ "Is wake word detected?"                  │
│                                                  │
│ 4. ReSpeaker LED Feedback                       │
│    └─ 🟣 Purple pulse = Listening               │
│                                                  │
│ 5. Fast-Path LLM Query (0.26s)                  │
│    └─ Groq LLaMA 3.1 8B instant query           │
│                                                  │
│ 6. Slow-Path LLM Query (with tools)             │
│    └─ OpenClaw Gateway Session (HA, Web, Weather)│
│                                                  │
│ 7. Edge TTS (el-GR-NestorasNeural / Edge)        │
│    └─ Generate Greek/English response           │
│                                                  │
│ 8. Speaker Output (hw:C2c,0)                    │
│    └─ Play response                             │
│                                                  │
└─────────────────────────────────────────────────┘

Features

  • Double-Path Architecture:
  • Fast-Path: High-performance Groq-based direct query with ultra-low latency (~260ms) for casual queries.
  • Slow-Path: Fully-featured OpenClaw gateway connection with device control, web search, and weather integration.
  • Bilingual Processing: Seamless support and automatic language matching for Greek and English.
  • Local Wake Word Detection: Employs openWakeWord with a custom pre-trained model (hey_jarvis_v0.1.onnx) optimized for Jetson ARM64 architecture.
  • Systemd Integration: Includes service templates for background daemonization and boot-persistent listening.
  • Visual LED Feedback: Direct support for the ReSpeaker Mic Array LED indicators (purple pulse on listening).

Installation & Setup

  1. System Dependencies:

Ensure arecord and Python 3.11 are installed.

  1. Required Environment Variables:

Configure the following environment variables in your systemd unit or .bashrc:

  • OPENCLAW_GATEWAY_TOKEN: OpenClaw Gateway authorization token.
  • GATEWAY_URL: Base URL of the OpenClaw instance (e.g., http://127.0.0.1:18789).
  • GROQ_API_KEY: API Key for fast-path inference.
  • ALFRED_MIC: Audio recording hardware interface (default: hw:Array,0).
  • ALFRED_SPEAKER: Audio output speaker interface (default: hw:C2c,0).
  1. Running the Daemon:

To start the pipeline daemon:

```bash

./scripts/manage.sh start

```

  1. Running as a Systemd Service:

Copy the service template to your user-level systemd directory (~/.config/systemd/user/voice-pipeline.service) and manage it via:

```bash

systemctl --user daemon-reload

systemctl --user enable voice-pipeline.service

systemctl --user start voice-pipeline.service

```

版本历史

共 2 个版本

  • v1.2.0 当前
    2026-06-01 12:03
  • v1.1.0
    2026-03-29 15:01 安全 安全

安全检测

腾讯云安全 (Keen)

队列中

腾讯云安全 (Sanbu)

队列中

🔗 相关推荐

ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 712 📥 243,773
ai-intelligence

Proactive Agent

halthelobster
将AI智能体从任务执行者升级为主动预判需求、持续优化的智能伙伴。集成WAL协议、工作缓冲区、自主定时任务及实战验证模式。Hal Stack核心组件 🦞
★ 836 📥 213,076
ai-intelligence

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 417 📥 115,182