← 返回
效率工具 中文

Faster Whisper Local

Local speech-to-text using faster-whisper. High-performance transcription with GPU acceleration support. Includes word-level timestamps and distilled models....
{"answer":"基于 faster-whisper 的本地语音转文本,支持 GPU 加速的高性能转录,包含词级时间戳与蒸馏模型..."}
damirikys
效率工具 clawhub v1.0.0 1 版本 99854.5 Key: 无需
★ 2
Stars
📥 1,333
下载
💾 39
安装
1
版本
#latest

概述

Faster-Whisper

High-performance local speech-to-text using faster-whisper.

Setup

1. Run Setup Script

Execute the setup script to create a virtual environment and install dependencies. It will automatically detect NVIDIA GPUs for CUDA acceleration.

./setup.sh

Requirements:

  • Python 3.10 or later
  • ffmpeg (installed on the system)

Usage

Use the transcription script to process audio files.

Basic Transcription

./scripts/transcribe audio.mp3

Advanced Options

  • Specific Model: ./scripts/transcribe audio.mp3 --model large-v3-turbo
  • Word Timestamps: ./scripts/transcribe audio.mp3 --word-timestamps
  • JSON Output: ./scripts/transcribe audio.mp3 --json
  • VAD (Silence Removal): ./scripts/transcribe audio.mp3 --vad

Available Models

  • distil-large-v3 (default): Best balance of speed and accuracy.
  • large-v3-turbo: Recommended for multilingual or highest accuracy tasks.
  • medium.en, small.en: Faster, English-only versions.

Troubleshooting

  • No GPU detected: Ensure NVIDIA drivers and CUDA are correctly installed. CPU transcription is significantly slower.
  • OOM Error: Use a smaller model (e.g., small or base) or use --compute-type int8.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 03:41 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

productivity

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 440 📥 147,769
productivity

Weather

steipete
获取当前天气和预报(无需API密钥)
★ 445 📥 226,333
productivity

Nano Pdf

steipete
使用nano-pdf CLI通过自然语言指令编辑PDF
★ 275 📥 114,870