← 返回
未分类 Key 中文

Openclaw Voice Gpt Realtime

Make real phone calls through your OpenClaw agent via OpenAI's Realtime API. ~200-300ms latency, natural voice, IVR navigation, voicemail detection.
通过 OpenClaw 代理使用 OpenAI 实时API 进行真实电话呼叫,约 200‑300ms 延迟,声音自然,支持 IVR 导航和语音邮件检测。
connorcallison connorcallison 来源
未分类 clawhub v0.1.4 1 版本 100000 Key: 需要
★ 2
Stars
📥 453
下载
💾 0
安装
1
版本
#latest

概述

Voice Calls (OpenAI Realtime)

Make real phone calls through your OpenClaw agent. Ask it to book a restaurant, check store hours, schedule an appointment — it dials the number, handles the conversation, and reports back with structured results.

Uses OpenAI's Realtime API for single-model speech-to-speech with ~200-300ms response latency. No separate STT or TTS — one model does it all.

Setup

This skill requires a Twilio account and an OpenAI API key with Realtime API access.

  1. Set your credentials in the plugin config (via OpenClaw settings or openclaw.json):
    • twilio.accountSid — your Twilio Account SID
    • twilio.authToken — your Twilio Auth Token
    • fromNumber — a Twilio voice-capable phone number (E.164 format, e.g. +17075551234)
    • openai.apiKey — your OpenAI API key
    • publicUrl — a public HTTPS origin that routes to the plugin's server (port 3335 by default). Must not be localhost/private/internal.
  1. Set up a tunnel (Cloudflare Tunnel, ngrok, Tailscale Funnel, etc.) so Twilio can reach the webhook server.
  1. Verify setup:
  2. openclaw voicecall-rt status
    

Usage

Just tell your agent what to call and why:

> "Call Tony's Pizza at +14155551234 and reserve a table for 4 on Friday at 7pm"

> "Call the barbershop at +14155559876 and book a haircut for Saturday morning"

> "Call +14155550000 and ask if they have the iPhone 16 Pro in stock"

The agent writes a system prompt for the voice AI, dials the number, and the voice AI handles the conversation autonomously — including navigating phone menus (DTMF), detecting voicemail, and reporting the outcome. The plugin wraps prompts with safety guardrails and blocks deceptive identity behavior.

CLI

openclaw voicecall-rt call -n +14155551234 -t "Check store hours"
openclaw voicecall-rt status
openclaw voicecall-rt active

Inbound calls

Optionally receive calls by enabling inbound.enabled and setting a policy (open or allowlist). Disabled by default.

Cost

~$0.31/min total (~$0.06 OpenAI input + ~$0.24 OpenAI output + ~$0.014 Twilio). A typical 5-minute call costs ~$1.55.

Notes

  • The voice AI waits for the callee to speak before talking ("listen first") — no awkward overlap on pickup.
  • Server binds to 127.0.0.1 by default. Only exposed via your tunnel.
  • Max 5 concurrent calls by default (configurable via calls.maxConcurrent).
  • Debug mode (debug: true) enables call recording, verbose logging, and latency metrics; recordings/transcripts may contain sensitive data.

版本历史

共 1 个版本

  • v0.1.4 当前
    2026-05-12 04:59 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Agent Browser

rez0
用于 AI 代理的浏览器自动化 CLI。当用户需要与网站交互(包括浏览页面、填写表单、点击按钮、截图等)时使用。
★ 859 📥 339,442
ai-agent

Find Skills

root
帮助用户发现和安装智能体技能,当用户询问如「如何做X」、「找X的技能」、「有能做...的吗」等问题时
★ 1,507 📥 566,483
ai-agent

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,434 📥 327,461