This skill provides transcription capabilities for audio and video files using the OpenAI Whisper API endpoint.
Send me an audio or video file and I'll transcribe it automatically! Just attach the file to your message.
You can also run the script directly from the command line:
cd /home/openclaw/.openclaw/workspace/skills/transcription/scripts
python3 transcribe_audio.py inputfile.ogg
For video files:
python3 transcribe_audio.py video.mp4
Audio: mp3, wav, mp4, mpeg, mpga, m4a, ogg, webm, flac, aac, wma
Video: mp4, mov, avi, mkv, webm (audio extracted automatically)
How to use the python scripts for transcription:
python3 transcribe_audio.py inputfile.ogg
http://192.168.0.11:8080/v1 (local Whisper endpoint)Scripts: scripts/transcribe_audio.py, scripts/transcribe_simple.py
References: references/transcription_guide.md
共 1 个版本