Voice Transcribe

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

使用 OpenAI gpt-4o-mini-transcribe 模型转录音频文件，支持词汇提示与文本替换。需要 uv。

darinkishore

内容创作 clawhub v1.0.1 1 版本 98485.6 Key: 需要

★ 13

Stars

📥 7,219

下载

💾 1,548

安装

版本

#latest

概述

voice-transcribe

transcribe audio files using openai's gpt-4o-mini-transcribe model.

when to use

when receiving voice memos (especially via whatsapp), just run:

uv run /Users/darin/clawd/skills/voice-transcribe/transcribe <audio-file>

then respond based on the transcribed content.

fixing transcription errors

if darin says a word was transcribed wrong, add it to vocab.txt (for hints) or replacements.txt (for guaranteed fix). see sections below.

supported formats

mp3, mp4, mpeg, mpga, m4a, wav, webm, ogg, opus

examples

# transcribe a voice memo
transcribe /tmp/voice-memo.ogg

# pipe to other tools
transcribe /tmp/memo.ogg | pbcopy

setup

add your openai api key to /Users/darin/clawd/skills/voice-transcribe/.env:

```

OPENAI_API_KEY=sk-...

```

custom vocabulary

add words to vocab.txt (one per line) to help the model recognize names/jargon:

Clawdis
Clawdbot

text replacements

if the model still gets something wrong, add a replacement to replacements.txt:

wrong spelling -> correct spelling

notes

assumes english (no language detection)
uses gpt-4o-mini-transcribe model specifically
caches by sha256 of audio file

版本历史

共 1 个版本

v1.0.1 当前

2026-03-28 09:47 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

content-creation

Humanizer

biostartechnology

消除AI写作痕迹，使文本更自然真实。基于维基百科"AI写作特征"指南，识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。

★ 857 📥 199,243

content-creation

Baidu Wenku AIPPT

ide-rea

使用百度文库 AI 智能生成 PPT，自动根据内容选择模板。

★ 66 📥 46,126

content-creation

YouTube

byungkyu

使用托管OAuth集成YouTube Data API，支持搜索视频、管理播放列表、获取频道数据及评论互动，适用于用户需要时使用此技能。

★ 141 📥 41,013