概述

Whisper Voice Transcription with whisper.cpp

When to use

You want local speech-to-text without sending audio to a third party.
You need a fallback workflow when a built-in transcription tool fails.
You want an operator guide for compiling and running whisper.cpp.

Prerequisites

git
cmake
a C or C++ compiler
ffmpeg

Build steps

git clone --depth 1 https://github.com/ggerganov/whisper.cpp.git ~/whisper.cpp
cd ~/whisper.cpp
cmake -B build -DCMAKE_BUILD_TYPE=Release
cmake --build build -j4

Download a model from the official ggerganov/whisper.cpp releases or Hugging Face repository and place it under ~/whisper.cpp/models/.

Standard transcription flow

ffmpeg -y -i input_audio.ogg -ar 16000 -ac 1 -f wav /tmp/voice.wav
~/whisper.cpp/build/bin/whisper-cli \
  -m ~/whisper.cpp/models/ggml-large-v3.bin \
  -f /tmp/voice.wav \
  -l auto \
  --no-timestamps

Fallback workflow

If a higher-level tool fails, first locate the exact cache or upload path used by that tool. Search only within the expected application cache directory instead of scanning the entire home directory.

Cloud fallback

If local transcription is too slow or unavailable, use an approved speech API and tell the user that audio will leave the machine.

Guardrails

Download binaries and models only from official sources.
Verify hashes when possible.
Do not search unrelated directories for audio files.
Be explicit when using a cloud provider because that changes the privacy model.

版本历史

共 1 个版本

v1.1.1 当前

2026-05-07 05:56 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)