← 返回
未分类 中文

whisperx

WhisperX provides local speech-to-text transcription using OpenAI Whisper, with high-quality offline recognition, no API key required, word-level timestamps,...
WhisperX 基于 OpenAI Whisper 提供本地语音转文字转录,支持高质量离线识别,无需 API 密钥,具备词级时间戳...
niuzb niuzb 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 162
下载
💾 0
安装
1
版本
#latest

概述

WhisperX Speech Recognition Skill

Local offline speech-to-text - A WhisperX-powered speech recognition skill for OpenClaw. Up to 30x faster than standard OpenAI Whisper, runs fully offline with no API key required.

Features

  • Pure ASR: Converts voice messages to text only — no voice replies generated
  • Fully offline: Model runs locally, no internet or API key needed
  • Word-level timestamps: Precise per-word time alignment
  • 90+ languages: Includes auto language detection
  • Speaker diarization: Optional, requires a HuggingFace token

Installation

# Install ffmpeg (macOS)
brew install ffmpeg

# Install ffmpeg (Ubuntu/Debian)
apt-get install ffmpeg

# Install WhisperX
pip install whisperx
# or using uvx:
uvx whisperx

GPU users: ensure CUDA 12.8 is installed for faster inference.

Usage

# Basic transcription (auto-detect language)
whisperx path/to/audio.wav

# Specify model and language
whisperx  --model small --language zh path/to/audio.wav

# CPU mode (low memory)
whisperx --model small --device cpu --compute_type int8  path/to/audio.wav

Notes

  • Dependencies: whisperx, ffmpeg
  • Supported formats: MP3, WAV, OGG, FLAC, M4A, OPUS, AAC, and all other ffmpeg-supported formats
  • Model cache: Downloaded automatically to ~/.cache/whisper/ on first run
  • Recommended models: base or small for CPU; large-v3 for GPU

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-12 06:21

安全检测

腾讯云安全 (Keen)

队列中

腾讯云安全 (Sanbu)

队列中

🔗 相关推荐

design-media

Video Frames

steipete
使用 ffmpeg 从视频中提取帧或短片。
★ 136 📥 53,188
design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 227 📥 48,785
design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 335 📥 94,638