← 返回
未分类

Douyin Transcriber

Transcribe speech from audio or video files, automatically extracting audio and converting to text using Docker Whisper ASR for Douyin/TikTok media.
使用 Docker Whisper ASR 自动从音视频文件中提取音频并转写为文字,适用于抖音/TikTok 媒体。
don068589
未分类 clawhub v1.0.5 1 版本 100000 Key: 无需
★ 0
Stars
📥 344
下载
💾 1
安装
1
版本
#latest

概述

Douyin Transcriber

Transcribe audio/video files to text using local Docker Whisper ASR.

Quick Start

curl -X POST "http://localhost:PORT/asr" -F "audio_file=@/path/to/video.mp4"

The container has built-in ffmpeg for automatic audio extraction.

Prerequisites

ToolPurposeInstall
------------------------
DockerWhisper ASRDocker Desktop
ffmpegAudio extractionwinget install Gyan.FFmpeg

Deploy Whisper ASR:

docker run -d -p PORT:PORT -e ASR_MODEL=small -e ASR_ENGINE=faster_whisper --name whisper-asr onerahmet/openai-whisper-asr-webservice:latest

Workflow

Step 1: Extract Audio from Video

ffmpeg -i video.mp4 -ar 16000 -ac 1 -c:a pcm_s16le audio.wav -y

Parameters:

  • -ar 16000: 16kHz sample rate
  • -ac 1: Mono channel
  • -c:a pcm_s16le: 16-bit PCM

Step 2: Transcribe

curl -X POST "http://localhost:PORT/asr" -F "audio_file=@audio.wav"

Optional: specify language

curl -X POST "http://localhost:PORT/asr" -F "audio_file=@audio.wav" -F "language=zh"

Step 3: Parse Result

Response format:

{
  "text": "Transcribed content...",
  "segments": [
    {"start": 0.0, "end": 2.5, "text": "First sentence"},
    {"start": 2.5, "end": 5.0, "text": "Second sentence"}
  ],
  "language": "zh"
}

Model Selection

ModelSize5-min videoAccuracy
------------------------------------
tiny75MB~30sFair
base142MB~1minGood
small466MB~3minBetter (recommended)
medium1.5GB~8minBest

Change model via environment variable: -e ASR_MODEL=medium

Supported Formats

Video: mp4, mkv, avi, mov, flv, wmv, webm, m4v

Audio: wav, m4a, mp3, aac, ogg, flac, wma, opus

Troubleshooting

IssueSolution
-----------------
Docker not availableInstall Docker Desktop
Container start failsCheck port availability
Transcription timeoutUse smaller model or split audio
ffmpeg not foundwinget install Gyan.FFmpeg

Related Modules

  • douyin-fetcher - Video download
  • douyin-analyzer - Content analysis
  • douyin-orchestrator - Workflow coordination

版本历史

共 1 个版本

  • v1.0.5 当前
    2026-05-07 09:40 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Douyin Video Transcribe

don068589
抖音视频转录套件。提取抖音/TikTok中国区视频音频,使用Whisper转录并分析内容。支持视频链接和本地文件。
★ 0 📥 877

Project Management System

don068589
面向AI智能体的综合项目管理平台,覆盖从项目启动到交付的全流程,支持结构化工作流、模板、质量关卡等。
★ 0 📥 558

Self-Improve

don068589
可插拔的AI智能体自我改进框架,自动学习错误、纠正和反馈,持续提升执行质量。
★ 0 📥 1,362