← 返回
数据分析 中文

Browser Audio Capture

Capture audio from any browser tab — meetings, YouTube, podcasts, courses, webinars — and stream to any AI agent. Zero API keys, works with any framework.
从任意浏览器标签页(会议、视频、播客、课程、网络研讨会)捕获音频并流式传输至任何AI代理。无需API密钥,兼容任意框架。
jarvis563
数据分析 clawhub v1.1.0 1 版本 99879.7 Key: 无需
★ 0
Stars
📥 830
下载
💾 10
安装
1
版本
#latest

概述

Browser Audio Capture

Give any AI agent ears for the browser. One Chrome extension captures audio from any tab — meetings, YouTube, podcasts, webinars, courses, earnings calls — and streams it to your AI pipeline.

Why Use This

Your AI agent can't hear anything happening in your browser. This skill fixes that. Capture audio from any Chrome tab and stream it to your agent — no API keys, no OAuth, no per-platform integrations.

Use cases: meeting summaries, YouTube/podcast notes, competitive intel from earnings calls, auto-notes from online courses, customer call analysis — anything that plays audio in a browser tab.

Works with any AI agent — Claude, ChatGPT, OpenClaw, LangChain, CrewAI, or your own. If your agent can run shell commands or receive HTTP, it gets browser audio.

Prerequisites

Chrome with remote debugging:

# macOS
/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome \
  --remote-debugging-port=9222 --user-data-dir=$HOME/.chrome-debug-profile &

Python 3.9+ with aiohttp: pip install aiohttp

Quick Start

CLI (any agent that can exec)

# List tabs — meetings flagged with 🎙️
python3 -m browser_capture.cli tabs

# Auto-detect and capture meeting tab
python3 -m browser_capture.cli capture

# Continuous watch mode
python3 -m browser_capture.cli watch --interval 15

# Stop
python3 -m browser_capture.cli stop

Chrome Extension (one-click, persistent)

  1. chrome://extensions/ → Developer mode → Load unpacked → scripts/extension/
  2. Join a meeting → click Percept icon → Start Capturing
  3. Close popup — capture continues in background

Supported Platforms

Google Meet • Zoom (web) • Microsoft Teams • Webex • Whereby • Around • Cal.com • Riverside • StreamYard • Ping • Daily.co • Jitsi • Discord — plus any future platform that runs in a browser.

Audio Output

Streams to http://127.0.0.1:8900/audio/browser as JSON:

{
  "sessionId": "browser_1709234567890",
  "audio": "<base64 PCM16>",
  "sampleRate": 16000,
  "format": "pcm16",
  "tabUrl": "https://meet.google.com/...",
  "tabTitle": "Weekly Standup"
}

Configure endpoint in scripts/extension/offscreen.js (PERCEPT_URL). Point it at Whisper, Deepgram, NVIDIA Riva, or any transcription service.

Troubleshooting

  • No tabs: Chrome needs --remote-debugging-port=9222
  • Button won't click: Remove + re-add extension (MV3 caches aggressively)
  • Audio not arriving: Check receiver on port 8900. Extension sends to /audio/browser

版本历史

共 1 个版本

  • v1.1.0 当前
    2026-03-29 18:11 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

communication-collaboration

Percept Voice Commands

jarvis563
检测语音中的唤醒词并将邮件、短信、提醒、搜索和笔记等语音指令路由到OpenClaw代理执行。
★ 0 📥 1,183
data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 165 📥 60,173
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 199 📥 65,221