whisper.cpp.gitcmakeffmpeggit clone --depth 1 https://github.com/ggerganov/whisper.cpp.git ~/whisper.cpp
cd ~/whisper.cpp
cmake -B build -DCMAKE_BUILD_TYPE=Release
cmake --build build -j4
Download a model from the official ggerganov/whisper.cpp releases or Hugging Face repository and place it under ~/whisper.cpp/models/.
ffmpeg -y -i input_audio.ogg -ar 16000 -ac 1 -f wav /tmp/voice.wav
~/whisper.cpp/build/bin/whisper-cli \
-m ~/whisper.cpp/models/ggml-large-v3.bin \
-f /tmp/voice.wav \
-l auto \
--no-timestamps
If a higher-level tool fails, first locate the exact cache or upload path used by that tool. Search only within the expected application cache directory instead of scanning the entire home directory.
If local transcription is too slow or unavailable, use an approved speech API and tell the user that audio will leave the machine.
共 1 个版本