Transcribe an audio file (wav/mp3/ogg...) to text using Qwen ASR. No configuration or API key required.
uv run scripts/main.py -f audio.wav
cat audio.wav | uv run scripts/main.py > transcript.txt
Qwen ASR is a free and open-source speech-to-text model.
It is trained on a large dataset of audio files from the web.
It is available in multiple languages.
This skill bases on the Qwen ASR Demo service (qwen-qwen3-asr-demo.ms.show).
共 1 个版本