Process audio messages and execute them as commands.
User sends an audio file without any text instruction:
ifly-speed-transcription skillUser sends an audio file WITH a text instruction:
ifly-speed-transcription skilluploader skill, return URLpython3 ~/.openclaw/workspace/skills/ifly-speed-transcription/scripts/transcribe.py /path/to/audio.mp3
python3 ~/.openclaw/workspace/skills/uploader/scripts/upload_media.py /path/to/file.txt
┌─────────────────┐
│ Audio Message │
└────────┬────────┘
│
▼
┌─────────────────┐
│ Transcribe │
│ (ifly-speed- │
│ transcription) │
└────────┬────────┘
│
▼
┌─────────────────┐ NO ┌──────────────┐
│ Has Text Cmd? │────────────►│ Use Transcrip│
└────────┬────────┘ │ as Command │
│ YES └──────┬───────┘
▼ │
┌─────────────────┐ │
│ Execute Text │ │
│ Cmd with Trans │ │
│ Context │ │
└────────┬────────┘ │
│ │
│ ▼
│ ┌──────────────┐
│ │ Return Direct│
│ │ to User │
│ │ (no upload) │
│ └──────────────┘
│
▼
┌─────────────────┐
│ Result > 58 ch? │
└────────┬────────┘
│
┌─────────────┴─────────────┐
│ YES │ NO
▼ ▼
┌─────────────────┐ ┌──────────────┐
│ Save to File │ │ Return Direct│
│ Upload via │ │ to User │
│ uploader skill │ └──────────────┘
└────────┬────────┘
│
▼
┌─────────────────┐
│ Return URL to │
│ User │
└─────────────────┘
User sends: 🎤 audio file (speech: "帮我查一下明天上海的天气")
Flow:
User sends: 🎤 audio file + text "帮我总结这段录音"
Flow:
User sends: 🎤 audio file + text "帮我根据这段录音写一篇文章"
Flow:
~/.openclaw/workspace/ before upload共 1 个版本