High-performance local speech-to-text using faster-whisper.
Execute the setup script to create a virtual environment and install dependencies. It will automatically detect NVIDIA GPUs for CUDA acceleration.
./setup.sh
Requirements:
Use the transcription script to process audio files.
./scripts/transcribe audio.mp3
./scripts/transcribe audio.mp3 --model large-v3-turbo./scripts/transcribe audio.mp3 --word-timestamps./scripts/transcribe audio.mp3 --json./scripts/transcribe audio.mp3 --vaddistil-large-v3 (default): Best balance of speed and accuracy.large-v3-turbo: Recommended for multilingual or highest accuracy tasks.medium.en, small.en: Faster, English-only versions.small or base) or use --compute-type int8.共 1 个版本